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Abstract 

"Symmetry" was one of the most important methodological themes 
in 20th-century physics and is probably going to play no lesser role 
in physics of the 21st century. As used today, there are a variety of 
interpretations of this term, which differ in meaning as well as their 
mathematical consequences. Symmetries of crystals, for example, gen- 
erally express a different kind of invariance than gauge symmetries, 
though in specific situations the distinctions may become quite subtle. 
I will review some of the various notions of "symmetry" and highlight 
some of their uses in specific examples taken from Pauli's scientific 
cevre. 

This paper is based on a talk given at the conference Wolfgang 
Pauli's Philosophical Ideas and Contemporary Science, May 20.-25. 
2007, at Monte Verita, Ascona, Switzerland. 



1 



Contents 



1 General Introduction 

2 Remarks on the notion of symmetry 

2.1 Spacetime 

2.2 Dynamical symmetries versus covariance 

2.3 Observable versus gauge symmetries 

3 Specific comments on symmetries in Pauli's work 

3.1 The hydrogen atom in matrix mechanics 

3.2 Particles as representations of spacetime automorphisms . . . 

3.3 Spin and statistics 

3.4 The meaning of 'general covariance' 

3.5 General covariance and antimatter 

3.6 Missed opportunities 

3.6.1 Supersymmetry 

3.6.2 Kaluza-Klein Monopoles 

3.7 Irritations and psychological prejudices 

3.8 (3-Decay and related issues 

3.8.1 CPT 

3.8.2 The Pauli group 

3.8.3 Cosmological speculations 

4 Conclusion 
Acknowledgements 
References 



2 



1 General Introduction 



In the Introduction to Pauli's Collected Scientific Papers, the editors, Ralph 
Kronig and Victor Weisskopf, make the following statement: 

It is always hard to look for a leading principle in the work of a 
great man, in particular if his work covers all fundamental prob- 
lems of physics. Pauli's work has one common denominator: his 
striving for symmetry and invariance. [...] The tendency towards 
invariant formulations of physical laws, initiated by Einstein, has 
become the style of theoretical physics in our days, upheld and de- 
veloped by Pauli during all his life by example, stimulation, and 
criticism. For Pauli, the invariants in physics where the symbols 
of ultimate truth which must be attained by penetrating through 
the accidental details of things. The search for symmetry and 
general validity transcend the limits of physics in Pauli 's work; 
it penetrated his thinking and striving throughout all phases of 
his life, in all fields of philosophy and psychology. "(|38j. Vol.1, 
p. viii) 

Indeed, if I were asked to list those of Pauli's scientific contributions which 
make essential use of symmetry concepts and applied group theory, I would 
certainly include the following, which form a substantial part of Pauli's 
scientific cevreQ 

Relativity theory and Weyl's extension thereof (1918-1921), the 
Hydrogen atom in matrix mechanics (1925), exclusion principle 
(1925), anomalous Zeeman effect and electron spin (1925), non- 
relativistic wave-equation for spinning electron (1927), covari- 
ant QED (1928, Jordan), neutrino hypothesis (1930), Kaluza- 
Klein theory and its projective formulation (1933), theory of y- 
matrices (1935), Poincare-invariant wave equations (1939, Fierz), 
general particle statistics and Lorentz invariance (1940, Belin- 
fante), spin-statistics (1940), once more General Relativity and 
Kaluza-Klein theory (1943, Einstein), meson-nucleon interaction 
and differential geometry (1953), CPT theorem (1955), (3-decay 
and conservation of lepton charge ('Pauli group', 1957), unifying 
non-linear spinor equation (collaboration with Heisenberg, 1957- 
58), group structure of elementary particles (1958, Touschek). 

Amongst the theoretical physicists of his generation, Pauli was certainly 
outstanding in his clear grasp of mathematical notions and methods. He had 

Two of the listed themes, "meson-nucleon interaction and differential geometry" and 
"unifying non-linear spinor equation", were never published in scientific journals (in 
the second case Heisenberg published for himself without Pauli's consent) but can be 
followed from his letters and manuscripts as presented in |45] . 
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a particularly sober judgement of their powers as well as their limitations 
in applications to physics and other sciences. Let us once more cite Kronig 
and Weisskopf: 

Pauli 's works are distinguished by their mathematical rigour and 
by a thorough and honest appraisal of the validity of assumptions 
and conclusions. He was a true disciple of Sommerfeld in his 
clear mathematical craftsmanship. By example and sharp criti- 
cism he constantly tried to maintain a similarly high standard in 
the work of other theoretical physicists. He was often called the 
living conscience of theoretical physicists. ([38]) Vol. 1, p.viii) 

It seems plausible that this critical impregnation dates back to his school- 
days, when young Pauli read, for example, Ernst Mach's critical analysis 
of the historical development of the science of mechanics, a copy of which 
Pauli received as a present from his Godfather (Mach) at around the age 
of fourteen. Mach's "Mechanik" , as this book is commonly called, starts 
out with a discussion of Archimedes' law of the lever, thereby criticising 
the following symmetry consideration ([13], p. 11-12): Imagine two equal 
masses, M, and a perfectly stiff and homogeneous rod of length L, both 
being immersed into a static homogeneous vertical gravitational field, where 
the rod is suspended at its midpoint, m, from a point p above; see Fig.[TJ 
What happens if we attach the two equal masses to the ends of the rod 
and release them simultaneously without initial velocity? An immediate 
symmetry argument suggests that it stays horizontal; it might be given as 
follows: Everything just depends on the initial geometry and distribution of 
masses, which is preserved by a reflection at the plane perpendicular to the 
rod through p and m. Suppose that after release the rod dropped at one side 
of the suspension point m, then the mirror image of that process would have 
the same initial condition with the rod dropping to the other side. This is a 
contradiction if the laws governing the process are assumed to be reflection 
symmetric and deterministic (unique outcome for given initial condition). 
This argument seems rigorous and correct. Now, how does one get from 
here to the law of the lever? The argument criticised by Mach is as follows: 
Assume that the condition for equilibrium depends only on the amount of 
mass and its suspension point on the rod, but not on its shape. Then we 
may replace the mass to the left of m by two masses of half the amount each 
on a small rod in equilibrium, as shown in the second (upper right) picture. 
Then replace the suspension of the small rod by two strings attached to the 
left arm of the original rod, as shown in the third (lower left) picture, and 
observe that the right one is just under the suspension point m, so that it 
does not disturb the equilibrium if it were cut away as in the last (lower right) 
picture of Fig.[TJ The weak point in the argument is clearly the transition 
from the second to the third picture: There is no global symmetry connecting 
them, even though locally, i.e. regarding the small rod only, it connects two 
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Figure 1: The law of the lever 'derived' from alleged symmetry consider- 
ations. The step from the upper right (second) to the lower left (third) 
picture does not follow. The small (blue) balls represent half the mass of 
the big (red) balls. 



equilibrium positions. It is easy to see that, in fact, the assumption that a 
global equilibrium is maintained in this change is equivalent to Archimedes' 
law of the lever. This example shows (in admittedly a fairly trivial fashion) 
that alleged symmetry properties can work as a petitio principii for the law 
to be derived. This is essentially the criticism of Mach. 

The reason why we consider this 'derivation' of the law of the lever to be 
a petitio principii is that we have other, physically much more direct ways to 
actually derive it from dynamical first principles. From that point of view the 
alleged symmetry is to be regarded as an artifact of the particular law and 
certainly not vice versa. The observed symmetry requires an explanation 
in terms of the dynamical laws, which themselves are to be established in 
an independent fashion. This is how we look upon, say, the symmetry of 
crystals or the symmetric shape of planetary orbits. 

On the other hand, all fundamental dynamical theories of 20th cen- 
tury physics are motivated by symmetry requirements. They are commonly 
looked at as particularly simple realisations of the symmetries in question, 
given certain a priori assumptions. It is clear that, compared to the previous 
example, there are different concepts of symmetry invoked here. However, 
there also seems to be a shift in attitude towards a more abstract under- 
standing of 'physical laws' in general. 

What makes Pauli an interesting figure in this context is that this shift 
in attitude can be traced in his own writings. Consider Special Relativity as 
an example, thereby neglecting gravity. One may ask: What is the general 
relation between the particular symmetry (encoded by the Poincare group) 
of spacetime and that very same symmetry of the fundamental interactions 
(weak, strong, and electromagnetic, but not gravity)? Is one to be considered 
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as logically prior to the other? For example, if we take Einstein's original 
operationalist attitude, we would say that the geometry of spacetime is 
defined through the behaviour of 'rods' and 'clocks', which eventually should 
be thought of as physical systems obeying the fundamental dynamical laws. 
In fact, Einstein often complained about the fact that rods and clocks are 
introduced as if they were logically independent of the dynamical laws, e.g., 
in a discussion remark at the 86th meeting of the Gesellschaft Deutscher 
Naturforscher und Arzte in Bad Nauheim in 19200 

It is a logical shortcoming of the Theory of Relativity in its 
present form to be forced to introduce measuring rods and clocks 
separately instead of being able to construct them as solutions to 
differential equations. ([66], Vol.7, Doc. 46, p. 353) 

From that viewpoint, symmetry properties of spacetime are nothing but an 
effective codification of the symmetries of the fundamental laws. Conse- 
quences like 'length contraction' and 'time dilation' in Special Relativity are 
then only effectively described as due to the geometry of spacetime, whereas 
a fundamental explanation clearly has to refer to the dynamical laws that 
govern clocks and rods. This was clearly the attitude taken by H.A. Lorentz 
and H. Poincare, though in their case still somehow afflicted with the idea 
of a material asther that, in principle, defines a preferred rest frame, so that 
the apparent validity of the principle of relativity must be interpreted as due 
to a 'dynamical conspiracy' o In his famous article on Relativity for the En- 
cyclopedia of Mathematical Sciences, the young Pauli proposes to maintain 
this view, albeit without the idea on a material aether. He writesQ 

Should one, then, in view of the above remarks, completely aban- 
don any attempt to explain the Lorentz contraction atomistically? 
We think that the answer to this question should be No. The con- 
traction of a measuring rod is not an elementary but a very com- 
plicated process. It would not take place except for the covariance 

2 German original: "Es ist eine logische Schwache der Relativitatstheorie in ihrem heuti- 
gen Zustande, dafi sie Mafistabe und Uhren gesondert einfiihren mufi, statt sie also 
Losungen von Differentialgleichungen konstruieren zu konnen." 

3 H.A. Lorentz still expressed this viewpoint well after the formulation of Special Rela- 
tivity, for example in [4TJ, p. 23. 

4 German original: "Ist aber das Bestreben, die Lorentz-Kontraktion atomistisch zu ver- 
stehen, vollkommen zu verwerfen? Wir glauben diese Frage verneinen zu miissen. Die 
Kontraktion des Mafistabes ist kein elementarer, sondern ein sehr verwickelter Prozcfi. 
Sie wurde nicht eintreten, wenn nicht schon die Grundgleichungen der Elektronentheo- 
rie sowie die uns noch unbekannten Gesetze, welche den Zusammenhalt des Elektrons 
selbst bestimmen, gegeniiber der Lorentz-Gruppe kovariant waren. Wir miissen eben 
postulieren, daB dies der Fall ist, wissen aber auch, dafi dann, wenn dies zutrifft, die 
Theorie imstande sein wird, das Verhalten von bewegten Mafistaben und Uhren atom- 
istisch zu erklaxen." ( 58 , p. 30.) 
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with respect to the Lorentz group of the basic equations of elec- 
tron theory, as well as those laws, as yet unknown to us, which 
determine the cohesion of the electron itself. We can only postu- 
late that this is so, knowing that then the theory will be capable of 
explaining atomistically the behaviour of moving measuring rods 
and clocks." (p3], p. 15.) 

Very recently, this traditional view has once more been defended under the 
name of 'Physical Relativity' [7] against todays more popular view, accord- 
ing to which Special Relativity is about the symmetry properties of space- 
time itself. Clearly, the latter view only makes sense if spacetime is endowed 
with its own ontological status, independently of the presence of rods and 
clocks. 

This shift in emphasis towards a more abstract point of view is also re- 
flected in Pauli's writings, for example in the Preface to the English edition 
of his 'Theory of Relativity' of 1956, where the abstract group-theoretic 
properties of dynamical laws are given an autonomous status in the expla- 
nation of phenomena: 

The concept of the state of motion of the 'luminiferous tether', 
as the hypothetical medium was called earlier, had to be given up, 
not only because it turned out to be unobservable, but because it 
became superfluous as an element of a mathematical formalism, 
the group-theoretical properties of which would only be disturbed 
by it. By the widening of the transformation group in general 
relativity the idea of a distinguished inertial coordinate system 
could also be eliminated by Einstein, being inconsistent with the 
group-theoretical properties of the theory. 

Pushed to an extreme, this attitude results in the belief that the most 
fundamental laws of physics are nothing but realisations of basic symme- 
tries. Usually this is further qualified by adding that these realisations are 
the most 'simple' ones, at least with respect to some intuitive measure of 
simplicity. Such statements are well known from Einstein's later scientific 
period and also from Heisenberg in connection with his 'unified theory' of 
elementary particles, for which he proposed a single non-linear differential 
equation, whose structure was almost entirely motivated by its symmetry 
properties. Heisenberg made this point quite explicitly in his talk entitled 
Planck's discovery and the foundational issues of atomism^, delivered dur- 
ing the celebrations of Max Planck's 100th anniversary — at which occasion 
Wolfgang Pauli received the Max-Planck medal in absentia — , where he also 

5 German original: "Die Plancksche Entdeckung und die philosophischen Grundfragen 
der Atomlehre". 
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talked about his own 'unified theory' H 

The mentioned equation contains, next to the three natural units 
[c, h, I], merely mathematical symmetry requirements. These 
requirements seem to determine everything else. In fact, one 
should just regard this equation as a particularly simple repre- 
sentation of the symmetry requirements, which form the actual 
core of the theory. 

Pauli, who briefly collaborated with Heisenberg on this project, did not at 
all share Heisenberg's optimism that a consistent quantum-field theory could 
be based on Heisenberg's non-linear field equation. His objections concerned 
several serious technical aspects, overlayed with an increasing overall dislike 
of Heisenberg's readiness to make premature claims, particularly when made 
publicly. 

However, I think it is fair to say that the overall attitude regarding the 
heuristic role and power of symmetry principles in fundamental physics, 
expressed by Heisenberg in the above quote, was also to a large extent 
shared by Pauli, not only in his later scientific life. This is particularly 
true for symmetry induced conservation laws, towards which Pauli had very 
strong feelings indeed. Examples from his later years will be discussed in 
later sections (e.g. Sect. 13. 7]) . An example from his early scientific life is 
his strong resistance against giving up energy-momentum conservation for 
individual elementary processes, while keeping it on the statistical average. 
Such ideas were advocated in the "new radiation theory" of Bohr, Kramers, 
and Slater of early 1924 [5] and again by Bohr in connection with (3-decay, 
which Pauli called spiritual somersaults in a letter to Max Delbriick. A week 
after his famous letter suggesting the existence of the neutrino, Pauli wrote 
to Oskar Klein in a letter dated Dec. 12th 19300 

6 German original: "Die erwahnte Gleichung enthalt neben den drei natiirlichen Mafiein- 
heiten nur noch mathematische Symmetrieforderungen. Durch diese Forderungen 
scheint alles weitere bestimmt zu sein. Man mufi eigentlich die Gleichung nur als eine 
besonders einfache Darstellung der Symmetrieforderungen, aber diese Forderung als den 
eigentlichen Kern der Theorie betrachten." (g5], Vol. IV, PartlVB, p. 1168) 

7 German original: "Erstens scheint es mir, dafi der Erhaltungssatz fur Energie-Impuls 
dem fur die Ladung doch sehr weitgehend analog ist und ich kann keinen theoretischen 
Grund dafiir sehen, warum letzterer noch gelten sollte (wie wir es ja empirisch fur 
den |3-Zerfall wissen), wenn ersterer versagt. Zweitens miifite bei einer Verletzung des 
Energiesatzes auch mit dem Gewicht etwas sehr merkwiirdiges passieren. [...] Dies 
widerstrebt meinem physikalischen Gefiihl auf das aufierste! Denn es mufi dann sogar 
auch fiir das Gravitationsfeld, das von dem ganzen Kasten (samt seinem radioaktiven 
Inhalt) selber erzeugt wird (...), angenommen werden, dafi es sich andern kann, wahrend 
wegen der Erhaltung der Ladung das nach aufien erzeugte elektrostatische Feld (beide 
Felder scheinen mir doch analog zu sein; das wirst Du ja iibrigens auch aus deiner 
fiinfdimensionalen Vergangenheit noch wissen) unverandert bleiben soil." ([45], Vol.11, 
Doc. [261], p. 45-46) 
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First it seems to me, that the conservation law for energy- 
momentum is largely analogous to that for electric charge, and I 
cannot see a theoretical reason why the latter should still be valid 
(as we know empirically from fi-decay) if the former fails. Sec- 
ondly, something strange should happen to the weight if energy 
conservation fails. [...J This contradicts my physical intuition 
to an extreme! For then one has to even assume that the grav- 
itational field produced [...] by the box (including the radioac- 
tive content) can change, whereas the electrostatic field must re- 
main unchanged due to charge conservation (both fields seem to 
me analogous; as you will remember from your five- dimensional 
past). 

This is a truly remarkable statement. Not many physicists would nowadays 
dare suggesting such an intimate connection between the conservation laws 
of charge and energy-momentum. What Pauli hints at with his last remarks 
in brackets is the Kaluza-Klein picture, in which electric charge is inter- 
preted as momentum in an additional space dimension in a five-dimensional 
spacetime. 

It is not difficult to find explicit commitments from Pauli's later scientific 
life expressing his belief in the heuristic power of symmetry considerations. 
Let me just select two of them. The first is from his introduction to the 
International Congress of Philosophers, held in Zurich in 1954, where Pauli 
statesH 

"It seems likely to me, that the reach of the mathematical group 
concept in physics is not yet fully exploited. " 

The second is from his closing remarks as the president of the conference 
"50 Years of Relativity" held in Berne in 1955, where with respect to the 
still unsolved problem of whether and how the gravitational field should be 
described in the framework of Quantum-Field-Theory he remarks^ 

It seems to me, that the heart of the matter [the problem of quan- 
tising the gravitational field] is not so much the linearity or non- 
linearity, but rather the fact that there is present a more general 
group than the Lorentz group. 

This, in fact, implicitly relates to much of the present-day research that is 
concerned with that difficult problem. 

8 German original: "Es ist mir wahrscheinlich, dass die Tragweite des mathematischen 
Gruppenbegriffes in der Physik heute noch nicht ausgeschopft ist." ([38], Vol. 2, p. 1345) 

9 German original: "Es scheint mir also, daB nicht so sehr die Linearitat oder Nichtlin- 
earitat der Kern der Sache ist, sondern eher der Umstand, dafi hier eine allgemeinere 
Gruppe als die Lorentzgruppe vorhanden ist."(;38], Vol.2, p. 1306) 
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Before we can discuss specific aspects of 'symmetry' in Pauli's work in 
Section[3l we wish to recall various aspects of symmetry principles as used 
in physics. 

2 Remarks on the notion of symmetry 
2.1 Spacetime 

The term 'symmetry' is used in such a variety of meanings, even in physics, 
that it seems appropriate to recall some of its its main aspects. One aspect 
is that which mathematicians call an 'automorphism' and which basically 
means a 'structure preserving self-map'. Take as an example (conceptually 
not an easy one) the modern notion of spacetime. First of all it is a set, 
M, the members of which are events, or better, 'potential events', since we 
do not want to assume that every spacetime point to be an actual physical 
event in the sense that a material happening is taking place, or at least 
not one which is dynamically relevant to the problem at hand0 That set 
is endowed with certain structures which are usually motivated through 
operational relations of actual physical events. 

One such structure could be that of a preferred set of paths, which repre- 
sent inertial (i.e. force free) motions of 'test bodies', that is, localised objects 
which do not react back onto spacetime structure. This defines a so-called 
'path-structure' (compare [E][l0]), which in the simplest case reduces to an 
affine structure in which the preferred paths behave, intuitively speaking, 
like 'straight lines'. This can clearly be said in a much more precise form 
(see, e.g., [60J). Under very mild technical assumptions (not even involving 
continuity) one may then show that the only automorphisms of that 'iner- 
tial structure' can already be narrowed down to the inhomogeneous Galilei 
or Lorentz groups, possibly supplemented by constant scale transformations 
(cf. [2][28])E±] 

Another structure to start with could have been that of a causal relation 
on M. That is, a partial order relation which determines the pairs of points 
on spacetime which, in principle, could influence each other in form of a 
propagation process based on ordinary matter or light signals. The auto- 
morphism group of that structure is then the subgroup of bijections on M 
that, together with their inverse, preserve this order relation. For example, 
in case of Minkowski space, where the causal relation is determined by the 

10 Minkowski was well aware that empty domains of spacetime may cause conceptual 
problems. Therefore, in his famous 1908 Cologne address Space and Time (German 
original: "Raum und Zeit"), he said: In order to not leave a yawning void, we wish 
to imagine that at every place and at every time something perceivable exists. German 
original: "Um nirgends eine gahnende Leere zu lassen, wollen wir uns vorstellen, dafi 
allerorten und zu jeder Zeit etwas Wahrnehmbares vorhanden ist". ([47], p. 2) 
We shall from now on use 'Poincare group' for 'inhomogeneous Lorentz group' and 
'Lorentz group' for 'homogeneous Lorentz group'. 
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light-cone structure, it may be shown that the most general automorphism 
is given by a Poincare transformation plus a constant rescalingfT] |73j. Since, 
according to Klein's Erlanger Programm [36], any geometry may be charac- 
terised by its automorphism group, the geometry of Minkowski space is, up 
to constant rescalings, entirely encoded in the causal relations. 

The same result can be arrived at through topological considerations. 
Observers (idealised to be extensionless) move in spacetime on timelike 
curves. Take the set C of all (not necessarily smooth) timelike curves which 
are continuous in the standard (Euclidean) topology 7e of Minkowski space- 
time M. Now endow M with a new topology, 7p, called the path topology, 
which is the finest topology on M which induces the same topology on each 
path in C as the standard (Euclidean) topology 7^. The new topology 7p 
is strictly finer than 7^ and has the following remarkable property: The 
automorphism group of (M,7pJ0, i.e. the group of bijections of M which, 
together with their inverses, preserve 7p, is just the Poincare group extended 
by the constant rescalings [32J. This is possibly the closest operational mean- 
ing one could attribute to the topology of spacetime, since in 7p a set in 
spacetime is open if and only if every observer "times" it to be open. 

All this is meant to illustrate that there are apparently different ways 
to endow spacetime with structures that are, physically speaking, more or 
less well motivated and which lead to the same automorphism group. That 
group may then be called the group of spacetime symmetries. So far, this 
group seems to bear no direct relation to any dynamical law. However, the 
physical meaning of such statements of symmetry is tight to an ontological 
status of spacetime points. We assumed from the onset that spacetime is 
a set M. Now, recall that Georg Cantor, in his first article on transfinite 
set-theory [8], started out with the following definition of a set 1^1 

By a 'set' we understand any gathering-together M of deter- 
mined well- distinguished objects m of our intuition or of our 
thinking (which are called the 'elements' of M.) into a whole. 

Hence we may ask: Is a point in spacetime, a 'potential event' as we called 
it earlier, a "determined well-distinguished object of our intuition or of our 
thinking" ? This question is justified even though modern axiomatic set the- 
ory is more restrictive in what may be called a set (for otherwise it runs 
into the infamous antinomies) and also stands back from any characterisa- 
tion of elements in order to not confuse the axioms themselves with their 

12 In the standard topological way of speaking this is just the 'homeomorphism group' of 
(M.Tp). 

13 German original: "Unter einer 'Menge' verstehen wir jede Zusammenfassung M von bes- 
timmten wohlunterschiedenen Objecten m unserer Anschauung oder unseres Denkens 
(welche die 'Elemente' von M. genannt werden) zu einem Ganzen." ([8], p. 481) 
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possible interpretations]^ However, applications to physics require inter- 
preted axioms, where it remains true that elements of sets are thought of 
as definite as in Cantors original definition. But it is just this definiteness 
that seems to be physically unwarranted in application to spacetime. The 
modern general-relativistic viewpoint takes that into account by a quotient 
construction, admitting only those statements as physically meaningful that 
are invariant under the group of (differentiable) permutations of spacetime 
points. This is possible only because all other structures on spacetime, in 
particular the metric and with it the causal structure, are not fixed once and 
for all but are subsumed into the dynamical fields. Hence no non-dynamical 
background structures remain, except those that are inherent in the defini- 
tion of a differentiable manifold. The group of automorphisms is therefore 
the whole diffeomorphism group of spacetime, which, in some sense, comes 
sufficiently close to the group of all permutations El 

2.2 Dynamical symmetries versus covariance 

What is the relation between spacetime automorphisms and symmetries 
of dynamical laws? Before we can answer this, we have to recall what a 
symmetry of a dynamical law is. 

For definiteness, let us restrict attention to dynamical laws in classical 
(i.e. non-quantum) physics. The equations of motion generally take the 
form of systems of differential equations, which we here abbreviate with EM 
(Equation of Motion). These equations involve two types of quantities: 1) 
background structures, collectively abbreviated here by L, and 2) dynamical 
entities, collectively abbreviated here by O. The former will typically be 
represented by geometric objects on M (tensor fields, connections, etc), 
which are taken from a somehow specified set B of 'admissible backgrounds'. 
Typical background structures are external sources, like currents, and the 
geometry of spacetime in non-general-relativistic field theories. Dynamical 
entities typically involve 'particles' and 'fields', which in the simplest cases 
are represented by maps to and from spacetime, 

y : R -> M ('particle') , (la) 
il> : M — > V ('field') , (lb) 

14 This urge for a clean distinction between the axioms and their possible interpretations 
is contained in the famous and amusing dictum, attributed to David Hilbert by his 
student Otto Blumenthal: "One must always be able to say 'tables', 'chairs', and 'beer 
mugs' instead of 'points, 'lines', and 'planes". (German original: "Man mufi jederzeit an 
Stelle von 'Punkten', 'Geraden' und 'Ebenen' 'Tische', 'Stiihle' und 'Bierseidel' sagen 
konnen.") 

15 There are clearly much more general bijections of spacetime than continuous or even 
differentiable ones. However, the diffeomorphism group is still n-point transitive, 
that is, given any two n-tuples of mutually distinct spacetime points, (pi , ■ ■ ■ ,p n ) 
and ( q i , ■ • • ,q n ), there is a diffeomorphism c}) such that for all 1 < i < n we have 
O(pi) = qi; this is true for all positive integers n. 
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and were V is usually some vector space. 

In order to state the equations of motion, one has to first specify a set of 
so-called^lkinematically possible trajectories out of which the dynami- 
cal entities O are taken and solutions to the equations of motion are sought. 
Usually this involves particle trajectories which are sufficiently smooth (typ- 
ically piecewise twice continuously differentiable) and fields which are suf- 
ficiently smooth and in addition have a sufficiently rapid fall-off at large 
spatial distances, so as to give rise to finite quantities of energy, angular- 
momentum, etc. This space of kinematically possible trajectories will be 
denoted by tC. According to the discussion above, the equation of motion 
takes two arguments, one from B the other from /C, and is hence written in 
the form 

EM{I | <D} = 0, (2) 

where the zero on the right-hand side may be a many-component object. 
Equation (J2j) should be read as a selection criterion on the set /C, depending 
on the externally specified values of L. We shall sometimes write EMj for 
EM{ L ■ } to denote the particular equation of motion for O corresponding 
to the choice L for the background structures. In general, the sets of solu- 
tions to ([2]) for variable L are I-dependent subset T>i_ C /C, whose elements 
are called the dynamically possible trajectories^!. We can now say 
more precisely what is usually meant by a symmetry: 

Definition 1 An abstract group G is called a symmetry group of the 

equations of motion ijj^j the following conditions are satisfied: 

1. There is an effective (see below) action G x K, — > K, of G on the set of 
kinematically possible trajectories, denoted by (g,®) i— > g • O. 

2. This action leaves the subset T>^ C K, invariant; that is, for all g in G 
we have: 

EM{ 1 1 <D} = EM{ 1 1 g • O } = . (3) 

Recall that an action is called effective if no group element other than the 
group identity fixes all points of the set it acts on. Effectiveness is required 
in order to prevent mathematically trivial and physically meaningless exten- 
sions of G. What really matters are the orbits of G in /C, that is, the subsets 
O® = {g • O | g 6 G} for each O 6 K,. If the action were not effective, we 
could simply reduce G to a smaller group with an effective action and the 
same orbits in /C, namely the quotient group G/G', where G' is the normal 
subgroup of elements that fix all points of /C. 

It should be noted that this definition is still very general due to the 
fact that no further condition is imposed on the action of G, apart from the 

16 This terminology is due to James Anderson [2] . 

17 Throughout we use "iff" as abbreviation for "if and only if" . 
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obvious one of effectivity. For example, for fields one usually requires the 
action to be 'local', in the sense that for any point p of spacetime, the value 
( 9 ■ MO (p) °f the g-transformed field should be determined by the value of 
the original field at some point p' of spacetime, and possibly finitely many 
derivatives of \|> at p'. If there are no dependencies on the derivatives, the 
action is sometimes called 'ultralocal'. Note that the point p' need not be 
identical to p, but it is assumed to be uniquely determined by g and p. A 
striking example of what can happen if locality is not imposed is given by 
the vacuum Maxwell equations (no external currents), which clearly admit 
the Poincare group as ultralocally acting symmetry group. What is less well 
known is the fact that they also admit the inhomogeneous Galilei group as 
symmetry groupEl, albeit the action is non-local; see [20] or Chap. 5.9 of 
[21j . (There are also other non-local symmetries of the vacuum Maxwell 
equations [19J.) 

To be strictly distinguished from the notion of symmetry is the notion 
of covariance, which we define as follows: 

Definition 2 An abstract group G is called a covariance group of the 

equations of motion iff the following conditions are satisfied: 

1. There is an effective action G x/C — ) fC of G on the set of kinematically 
possible trajectories, denoted by (g, O) i— > g • <D. 

2. There is also an action ( this time not necessarily effective ) G x B — > B 
of G on the set of background structures, likewise denoted by (g,£) •— > 

gl. 

3. The solution-function L i— ) T>i_ C K from B into the subsets of K, is 
G-equivariant. This means the following: If g • T>i_ denotes the set 
{g • <D | ® G T)£\, then, for all g in G, we have 

9-V L = V g . L . (4) 

An alternative way to say this is that the relation that EM establishes 
on B x /C via is G invariant, that is, for all g in G, we have 

EM{ I | <D } = EM{ g • I | g • <D } = . (5) 

The obvious difference between (|3|) and ([5]) is that in the former case the 
background structure is not allowed to change. The transformed dynamical 
entity is required to satisfy the very same equation as the untransformed one, 

18 This is different from, and certainly more surprising than, the better known (ultra local) 
Galilei symmetry of Maxwell's equations in the presence of appropriate constitutive 
relations between the electric field E and the electric displacement-field D on one side, 
and between the magnetic induction-field B and the magnetic field H on the other; see 
e.g. [39] and [27]. 



14 



whereas for a covariance it is only required to satisfy a suitably changed set 
of equations. Here 'changed' refers to the fact that g • L is generally different 
from L. Hence it is clear that a symmetry group is automatically also a 
covariance group, by just letting it act trivially on the set B of background 
structures. The precise partial converse is as follows: Given a covariance 
group G with action on £>, then for each L £ B define the 'stabiliser subgroup' 
of L in G as the set of elements in G that fix L, 

Stab G (I) :={g e G | g •! = !}. (6) 

Then the subgroup Stabc(^) of the covariance group is also a symmetry 
group of the equation of motion EM^. 

The requirement of covariance is a rather trivial one, since it can always 
be met by suitably taking into account all the background structures and 
a sufficiently general action of G on B. To see how this works in a specific 
example, consider the ordinary 'heat equation' for the temperature field T 
(k is a dimensionful constant): 

3 t T - kAT = . (7) 

Let 6 = E3 x R be the 7-parameter group of Euclidean motions (rotations 
and translations in R 3 ) and time translations, whose defining representation 
on spacetime (R 3 x R) is denoted by g — > p g , then G acts effectively on 
the set of temperature fields via g • T := T o p g _! (the inverse being just 
introduced to make this a left action). It is immediate from the structure of 
(J7|) that this implements G as symmetry group of this equation. The back- 
ground structures implicit in ([7]) are: a) a preferred split of spacetime into 
space and time, 2) a preferred measure and orientation of time, and c) a pre- 
ferred distance measure on space. There are many ways to parametrise this 
structure, depending on the level of generality one starts from. If, for exam- 
ple, we start from Special Relativity, we only list those structural elements 
that we need on top of the Minkowski metric {rj^} = diag(l , — 1 , — 1 , — 1 ) in 
order to write down ([7]) . They are given by a single constant and normalised 
timelike vector field n, by means of which we can write ([7]) in the form 

EM{n I T}:=u^T- k(u^u v -r)^)3^T = . (8) 

In the special class of inertial reference frames in which = (1,0,0,0) 
equation (jHJ) reduces to ([7]) . From the structure of ([8]) it is obvious that this 
equation admits the whole Poincare group of Special Relativity as covariance 
group. However, the symmetry group it contains is the stabiliser subgroup 
of the given background structure. The latter is given by the vector field 
n, whose stabiliser subgroup within the Poincare group is just E3 x R, the 
same as for ([TJ. 

Had we started from a higher level of generality, in which no preferred 
coordinate systems are given to us as in Special Relativity, we would write 
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the heat equation in the form 

EM{n, g | T} := #V,T - k(u V - g^V.V J = , (9) 

where now n as well as g feature as background structures, n is again spec- 
ified as unit timelike covariant-constant vector field, g as a flat metric, and 
V as the unique covariant derivative operator associated to g (i.e. torsion 
free and preserving g). Since V is here taken as a unique function of g, 
it does not count as independent background structure. Once again it is 
clear from the structure of (J9j) that the covariance group is now the whole 
diffeomorphism group of spacetime. However, the symmetry group remains 
the same as before since the stabiliser subgroup of the pair (g,u) is E3 x R. 

This example should make clear how easy it is to almost arbitrarily inflate 
covariance groups by starting from higher and higher levels of generality 
and adding the corresponding extra structures into ones list of background 
structures. This possibility is neither surprising nor particularly disturbing. 
Slightly more disturbing is the fact that a similar game can be played with 
symmetries, at least on a very formal level. The basic idea is to simply 
declare background structures to be dynamical ones by letting their values be 
determined by equations. We may do this since we have so far not qualified 
'equations of motion' as any special sort of equations. For example, in the 
special relativistic context we may just take (|8|) and let n be determined by 

n*n v T]^ = h 3^ = 0. (10) 

Then ([8]) and (I10p together define a background free (from the special rela- 
tivistic point of view) system of equations for T, n which has the full Poincare 
group as symmetry group. Its symbolic form is 

EM{0 |T,n} = 0, (11) 

where the on the right-hand side has now 18 components: one for ([8]), one 
for the first equation in (I10p . and 16 (= 4 x 4) for the second equation in 
(|10p . But note that its T-sector of solution space is not the same as that 
of ©, as it now also contains solutions for different n. However, as the 
equations ()10p for u do not involve T, the total solution space for (n, T) can 
be thought of as fibred over the space of allowed n, with each fibre over n 
being given by the solutions T of ([8]) for that given u. Each such fibre is a 
faithful image of the original solution space of ((TJ), suitably transformed by 
a Lorentz boost that relates the original n in ([7]) (i.e. {u^} = (1,0,0,0)) to 
the chosen one. 

Even more radically, we could take (J9|) and declare n and g to be dy- 
namical entities obeying the extra equations 

n^Qjtv = 1 , V^n* = , Riem[g] = , (12) 
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where Riem is the Riemann curvature tensor of g, so that the last equation 
in (fT2|) just expresses flatness of g. The system consisting of © and (fT2|) 
has no background structures and admits the full diffeomorphism group as 
symmetry group. It is of the symbolic form 

EM{ | T, n, g } = , (13) 

which now comprises 36 components: the 16 as above and an additional 
set of 20 for the independent components of Riem. Again, note that the 
T-sector of solution space of ([9]) is now much bigger that of ((ZJ) of or (jSJ). 
With any solution T it also contains its diffeomorphism-transformed one, 
T' = To where <J> G Diff(M). Again, since the equations for n and g 
do not involve T, the total solution space is fibred over the allowed n and 
g fields, with each fibre corresponding to a faithful image of the original 
solution space for (|7|). 

Finally we remark that, in principle, constants appearing in equations 
of motion could also be addressed as background structures whose values 
might eventually be determined by more general dynamical theories. For 
example, one might speculate (as was done some time ago in the so-called 
Brans-Dicke theories) that the gravitational constant is actually the value of 
some field that only in the present epoch of our Universe has settled to a spa- 
tially constant and quasi-static value, but whose value at much earlier times 
was significantly different. Another example from Quantum Field Theory 
concerns the idea that masses of elementary particles are dynamically gen- 
erated by the so-called Higgs field (whose existence is strongly believed but 
not yet experimentally confirmed). 

In any case, the important message from the considerations of this sub- 
section is the following: symmetries emerge or disappear if, respectively, 
background structures become dynamical (L — > O) or dynamical structures 
'freeze' (<D — > I). 

2.3 Observable versus gauge symmetries 

Within the concept of symmetry as explained so far, an important distinc- 
tion must be made between observable symmetries on one hand, and gauge 
symmetries on the other. An observable symmetry transforms a state or 
a history of states (trajectory) into a different, that is, physically distin- 
guishable state or history of states. On the other hand, a gauge symmetry 
transforms a state or a history of states into a physically indistinguishable 
state or a history of states. In this case there is a redundancy in the math- 
ematical description, so that the map from mathematical labels to physical 
states is not faithful. This is usually associated with a group, called the 
group of gauge transformations, denoted by G gau , which acts on the set of 
state labels such that two such labels correspond to the same physical state 
iff they lie in the same orbit of G gau . 
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It is clear that the notion of 'distinguishability' introduced here refers 
to the set of observables, i.e. functions on state space that are physically 
realisable in the widest sense. Assuming for the moment that this was well 
defined, we could attempt a definition as follows: 

Definition 3 Let G be a symmetry group in the sense of Definition^ Then 
g G G is called an observable or physical symmetry iff there exists a 
O G T>i_ and a physical observable that separates g • O from O. If no such 
observable exists, g is called a gauge symmetry. 

It is clear that for a theoretician the stipulation of what functions on state 
space correspond to physically realisable observables is itself of hypothetical 
nature. However, what is important for us at this point is merely that 
relative to such a stipulation the distinction between observables and gauge 
symmetries makes sense. In the mathematical practice gauge symmetries are 
often signalled by an underdeterminedness of the equations of motion, which 
sometimes simply fail to restrict the motion in certain degrees of freedom 
which are then called 'gauge degrees of freedom'. In that case, given any 
solution O G T>£, we can obtain another solution, O', by just changing O in 
those non-determined degrees of freedom in an arbitrary way. For example, 
if the equations of motion are obtained via an action principle, such spurious 
degrees of freedom will typically reveal their nature through the property 
that motions in them are not associated with any action. As a result, the 
equations of motion, which are just the condition for the stationarity of 
the action, will not constrain the motion in these directions. Conversely, 
if according to the action principle the motion in some degree of freedom 
costs action, it can hardly be called a redundant one. In this sense an action 
principle is not merely a device for generating equations of motion, but also 
contains some information about observables. 

The combination of observable and gauge symmetries into the total sym- 
metry group G need not at all be just that of a semi-direct or even direct 
product. Often, in field theory, the gauge group G gau is indeed a subgroup 
of G, in fact an invariant (normal) one, but the observable symmetries, G b S! 
are merely a quotient and not a subgroup of G . In standard group theoretic 
terms one says that G is a G gau — extension of G Q b s - This typically happens 
in electromagnetism or more generally in Yang-Mills type gauge theories or 
General Relativity with globally charged configurations. In this case only 
the 'gauge transformations' with sufficiently rapid fall-off at large spatial 
distances are proper gauge transformations in our sense, whereas the long 
ranging ones cost action if performed in real tim^ and therefore have to 
be interpreted as elements of G Q bsj see e.g. [23] and Chap. 6 of [33]. 

This ends our small excursion into the realm of meanings of 'symmetry'. 
We now turn to the discussion of specific aspects in Pauli's work. 

19 By the very definition of global charge, which is just the derivative of the action with 
respect to a long-ranging 'gauge transformation'. 
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3 Specific comments on symmetries in Pauli's 
work 



The usage of symmetry concepts in Pauli's work is so rich and so diverse that 
it seems absolutely hopeless, and also inappropriate, to try to present them 
in a homogeneous fashion with any claim of completeness. Rather, I will 
comment on various subjectively selected aspects without in any way saying 
that other aspects are of any lesser significance. In fact, I will not include 
some of his most outstanding contributions, like, for example, the formula- 
tion of the exclusion principle, the neutrino hypothesis, or his anticipation 
of Yang-Mills Gauge Theory for the strong interaction. There exist excellent 
reviews and discussions of these topics in the literature. Specifically I wish 
to refer to Bartel van der Waerden's contribution Exclusion Principle and 
Spin to the Pauli Memorial Volume (|18j. pp. 199-244), Norbert Straumann's 
recent lecture on the history of the exclusion principle [68], Pauli's own ac- 
count of the history of the neutrino (in English: [57j . pp. 193-217; in German: 
[38], Vol. 2, pp. 1313-1337 and [55], p. 156-180), Chien-Shiung Wu's account 
The Neutrino in the Pauli Memorial Volume (|18|. pp. 249-303), and the his- 
torical account of gauge theories by Lochlainn O'Raifeartaigh and Norbert 
Straumann [50] . A non-technical overview concerning Pauli 's Belief in Ex- 
act Symmetries is given by Karl von Meyenn [IB]. Last, but clearly not 
least, I wish to mention Charles Enz's fairly recent comprehensive scientific 
biography [16] of Wolfgang Pauli, which gives a detailed discussion of his 
scientific cevre. 

In this contribution I rather wish to concentrate on some particular as- 
pects of the notion of symmetry that are directly related to the foregoing 
discussion in Sections l2.2l and l2.3l as I feel that they are somewhat neglected 
in the standard discussions of symmetry. 

3.1 The hydrogen atom in matrix mechanics 

In January 1926 Pauli managed to deduce the energy spectrum for the Hy- 
drogen atom from the rules of matrix mechanics. For this he implicitly 
used the fact that the mechanical problem of a point charge moving in a 
spherically symmetric force-field with a fall-off proportional to the square 
of the inverse distance has a symmetry group twice as large (i.e. of twice 
the dimension) as the group of spatial rotations alone, which it contains. 
Hence the total symmetry group is made half of a 'kinematical' part, refer- 
ring to space, and half of a 'dynamical' part, referring to the specific force 
law (1/r 2 fall-off). Their combination is a proper physical symmetry group 
that transforms physically distinguishable states into each other. In the 
given quantum-mechanical context one also speaks of 'spectrum generating' 
symmetries. 

Let us recall the classical problem in order to convey some idea where the 
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symmetries and their associated conserved quantities show up, and how they 
may be employed to solve the dynamical problem. Consider a mass-point of 
mass m and position coordinate fin the force field F(f) = — (K/r 2 )ft, where 
r is the length of f, ft := f/r, and K is some dimensionful constant. Then, 
according to Newton's 3rd law (an overdot stands for the time derivative), 

r = -^rt (lc = K/m). (14) 

Next to energy, there are three obvious conserved quantities correspond- 
ing to the three components of the angular-momentum vector (here written 
per unit mass) 

I=fxf. (15) 

But there are three more conserved quantities, corresponding to the compo- 
nents of the following vector (today called the Lenz-Runge vector), 

e = k" 1 fx?-u. (16) 

Conservation can be easily verified by differentiation of (|16p using (|14p and 
ft = i X ft/r . Hence on has (i = length of i) 

£-r = 0, f-e = 0, r + f • e - YrH 2 = , (17) 

from which the classical orbit immediately follows: Setting f • e = re cos cp, 
the last equation (fTTl) reads 

- (18) 



1 + e cos cp 

which is the well known equation for a conic section in the plane perpendic- 
ular to I, focus at the origin, eccentricity e (= length of e), and latus rectum 
2£ 2 /k. The vector e points from the origin to the point of closest approach 
(periapsis). The few steps leading to this conclusion illustrate the power 
behind the method of working with conservation laws which, in turn, rests 
on an effective exploitment of symmetries. 

The total energy per unit mass is given by F_ = jf 2 — k/r. A simple 
calculation shows that 

e 2 -l = 2E£ 2 /k 2 , (19) 

which allows to express E as function of the invariants e 2 and I 2 . This is the 
relation which Pauli shows to have an appropriate matrix analogue, where 
it allows to express the energy in terms of the eigenvalues of the matrices for 
I and e which Pauli determines, leading straight to the Balmer formula. 

From a modern point of view one would say that, for fixed energy E < 0@ 
the state space of this problem carries a Hamiltonian action of the Lie algebra 



20 For E > one obtains a Hamiltonian action of so (1 ,3) 
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50 (4), generated by the 3+3 quantities I and e. Quantisation then consists 
in the problem to represent this Lie algebra as a commutator algebra of 
self- adjoint operators and the determination of spectra of certain elements 
in the enveloping algebra. This is what Pauli did, from a modern point of 
view, but clearly did not realise at the time. In particular, even though he 
calculated the commutation relations for the six quantities i and e, he did 
not realise that they formed the Lie algebra for 50 (4), as he frankly stated 
much later (1955) in his address on the occasion of Hermann Weyl's 70th 
birthday^ 

Similarly I did not know that the matrices which I had derived 
from the new quantum mechanics in order to calculate the en- 
ergy values of the hydrogen atom were a representation of the 
^-dimensional orthogonal group. 

This may be seen as evidence for Pauli's superior instinct for detecting rel- 
evant mathematical structures in physics. Much later, in a CERN-report of 
1956, Pauli returned to the representation-theoretic side of this problem |53j . 

3.2 Particles as representations of spacetime automorphisms 

The first big impact of group theory proper on physics took place in quan- 
tum theory, notably through the work or Eugene Wigner [72] and Hermann 
Weyl [70J. While in atomic spectroscopy the usage of group theory could 
be looked upon merely as powerful mathematical tool, it definitely acquired 
a more fundamental flavour in (quantum) field theory. According to a dic- 
tum usually attributed to Wigner, every elementary system (particle) in 
special-relativistic quantum theory corresponds to a unitary irreducible rep- 
resentation of the Poincare group o In fact, all the Poincare invariant linear 
wave equations on which special-relativistic quantum theory is based, known 
by the names of Klein & Gordon, Weyl, Dirac, Maxwell, Proca, Rarita & 
Schwinger, Bargmann & Wigner, Pauli & Fierz, can be understood as projec- 
tion conditions that isolate an irreducible sub-representation of the Poincare 
grourj^l within a reducible one that is easy to write down. More concretely, 
the latter is usually obtained as follows: Take a field r[> on spacetime M with 

21 German original: "Ebensowenig wufite ich, dafi die Matrices, die ich ausgerechnet hatte, 
um die Energiewerte des Wasserstoffatoms aus der neuen Quantenmechanik abzuleiten, 
eine Darstellung der 4-dimensionalen othogonalen Gruppe gewesen sind". ( |45| . Vol. IV, 
Part III, Doc. [2183], p. 402) Note that, in modern terminology, Pauli actually refers to 
a representation of the Lie algebra of the orthogonal group. 

22 The converse is not true, since there exist unitary irreducible representations which 
cannot correspond to (real) particles, for example the so-called 'tachyonic' ones, corre- 
sponding to spacelike four-momenta. 

23 More precisely, its universal cover R x SL(2,C), or sometimes an extension thereof by 
the discrete transformations of space and time reversal. 
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values in a finite-dimensional complex vector space V. Let D be a finite- 
dimensional irreducible representation of the (double cover of the) Lorentz 
group SL(2,C) on vE^I It is uniquely labelled by a pair (p, q) of two positive 
integer- or half- integer- valued numbers. In the standard terminology, 2p 
corresponds to the number of unprimed, 2q to the number of primed spinor 
indices of it>. The set of such fields furnishes a linear representation of the 
(double cover of the) Poincare group, R 4 x SL(2,C), where the action of the 
group element g = (a, A) is given by 

g-il-^D^^og- 1 ), (20) 

or for the Fourier transform ij>, 

g-^exptip^MAM^oA- 1 ). (21) 

One immediately infers from (|2ip that irreducibility implies that i|) must 
have support on a single SL(2,C) orbit in momentum space. Here one usually 
restricts to those orbits consisting of non-spacelike p (those with spacelike 
p give rise to the tachyonic representations which are deemed unphysical), 
which are labelled by p^p^ = m 2 with non-negative m. For il> this means 
that it obeys the Klein-Gordon equation (□ + m 2 )if> = 0. This is already 
half the way to an irreducible representation, insofar as it now contains only 
modes of fixed mass. But these modes still contains several spins up to the 
maximal value p + q. A second and last step then consists of projecting 
out one (usually the highest) spin, which gives rise to the equations named 
above. In this fashion the physical meanings of mass and spin merge with 
the abstract mathematical meaning of mere labels of irreducible representa- 
tions. Mass and spin are the most elementary attributes of physical objects, 
so that objects with no other attributes are therefore considered elementary. 
As just described, these elementary attributes derive from the representa- 
tion theory of a group whose significance is usually taken to be that it is 
the automorphism group of spacetime. However, as already discussed in 
SectionsQ] and 12.11 this point of view presupposes a hierarchy of physical 
thinking in which spacetime (here Minkowski space) is considered an entity 
prior to (i.e. more fundamental than) matter, which may well be challenged. 
A more consistent but also more abstract point of view would be to think 
of the abstractly Poincare group as prior to the matter content as well as 
the spacetime structure and to derive both simultaneously. Here 'deriving' 

24 The representation D is never unitary, simply because the Lorentz group has no non- 
trivial finite-dimensional unitary irreducible representations. But it will give rise to an 
infinite-dimensional representation on the linear space of fields i|> which will indeed be 
unitary. 

25 'Abstract' here means to consider the isomorphicity class of the group as mathematical 
structure, without any interpretation in terms of transformations of an underlying set 
of objects. 
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a spacetime structure (geometry) from a group would be meant in the sense 
of Klein's Erlanger Programm |36j . 

We have already discussed in Section[T] Pauli 's shift in emphasis towards 
a more abstract point of view as regards spacetime structure. But also as 
regards to matter he was, next to Wigner, one of the proponents to put sym- 
metry considerations first and to derive the wave equations of fundamental 
fields as outlined above. Based on previous work by Fierz on the theory of 
free wave equations for higher spin [T7] , Fierz and Pauli published their very 
influential paper On Relativistic Equations for Particles of Arbitrary Spin 
in an Electromagnetic Field (|38j, Vol.2, pp. 873-894) which is still much 
cited today. 

In fact, much earlier, in his 1927 paper Quantum Mechanics of the 
Magnetic Electrorfi^, Pauli succeeded to implement the electron's spin into 
non-special-relativistic quantum mechanics in an entirely representation- 
theoretic fashion as regards the (Lie algebra of) spatial rotations. In contrast 
to the other (translational) degrees or freedom, spin does not appear as the 
quantisation of an already existent classical degree of freedom. This must 
have appeared particularly appealing to Pauli, who never wanted the elec- 
tron's 'spin' to be understood as an intrinsic angular momentum due to a 
spatial rotation of a material structure. When Pauli introduced the new spin 
quantum-number for the electron in his 1924 paper On the Influence of the 
Velocity Dependence of the Electron Mass on the Zeeman EffecW\ he delib- 
erately stayed away from any model interpretation and cautiously referred 
to it as a peculiar, classically indescribable disposition of two-valuedness of 
the quantum-theoretic properties of the light- electror¥^\. At that time an 
understandable general scepticism against possible erroneous prejudices im- 
posed by the usage of classical models had already firmly established itself 
in Pauli's (and others) thinking. 

As much justified as this is in view of Quantum Mechanics, this had also 
led to overstatements to the effect that spin has no classical counterpart 
and that any classical model is even classically contradictory in the sense 
of violating Special Relativity. As regards the second point, which was also 
pushed by Pauli, we refer to the detailed discussion in [22J. To the first point 
we first wish to mention that composite models with half-integer angular 
momentum states exist in ordinary Quantum Mechanics (without spin), as, 
e.g., pointed out by Bopp & Haag in 1950 [§J. This is possible if their classical 
configuration space contains the whole group SO (3) of spatial rotations. 

26 German original: "Zur Quantenmechanik des magnetischen Elektrons". ([38], Vol.2, 
pp. 306-330) 

27 German original: "Uber den Einflufi der Geschwindigkeitsabhangigkeit der Elektronen- 
masse auf den Zeemaneffekt" . ([3H], Vol.2, pp. 201-213) 

28 German original: "eine eigentiimliche, klassisch nicht beschreibbare Art von Zwei- 
deutigkeit der quantentheoretischen Eigenschaften des Leuchtelektrons" f [38] . Vol.2, 
p. 213). 
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Pauli himself showed in his 1939 paper On a Criterion for Single- or Double- 
Valuedness of the Eigenf unctions in Wave Mechanics^ the possibility of 
double- valued wavefunctions, which are the ones that give rise to half-integer 
angular momentum states. Moreover, in classical mechanics there is also a 
precise analog of Wigner's notion of an elementary system. Recall that 
the space of states of a mechanical system is a symplectic manifold (phase 
space). The analog of an irreducible and unitary representation of the group 
of spacetime automorphisms is now a transitive and Hamiltonian action of 
this group on the symplectic manifold. It is interesting to note that this 
classical notion of an elementary system was only formulated much later 
than, and in the closest possible analogy with, the quantum mechanical 
one. An early reference where this is spelled out is The classification 
of elementary systems is now equivalent to the classification of symplectic 
manifolds admitting such an action. An early reference where this has been 
done is [3]. Here, as expected, an intrinsic angular momentum shows up 
as naturally as it does in Quantum Mechanics. What makes it slightly 
unusual (but by no means awkward or even inconsistent) is the fact that it 
corresponds to a phase spac^l that is not the cotangent bundle (space of 
momenta) over some configuration space of positions. 

Pauli's later writings also show this strong inclination to set the funda- 
mentals of (quantum) field theory in group-theoretic terms. In his survey 
Relativistic Field Theories of Elementary Particles ([38], Vol.2, pp. 923- 
952), written for the 1939 Solvay Congress, Pauli immediately starts a dis- 
cussion of "transformation properties of the field equations and conservation 
laws". His posthumously published notes on Continuous Groups in Quan- 
tum Mechanics [53] focus exclusively on Lie-algebra methods in representa- 
tion theory. 

Today we are used to define physical quantities like energy, momentum, 
and angular momentum as the conserved quantities associated to spacetime 
automorphisms via Noether's theorem. Here, too, Pauli was definitely an 
early advocate of this way of thinking. Reviews on the subject written 
shortly after Pauli's death show clear traces of Pauli's approach; see e.g. |35j . 

3.3 Spin and statistics 

Pauli's proof of the spin-statistics correlation [HT] (also [HH], Vol.2, pp. 911- 
922), first shown by Markus Fierz in his habilitation thesis |17| . is a truly 
impressive example for the force of abstract symmetry principles. Here we 
wish to recall the basic lemmas on which it rests, which merely have to do 
with classical fields and representation theory. 

29 German original: "Uber ein Kriterium fur Ein- oder Zweiwertigkeit der Eigenfunktionen 
in der Wellenmechanik" . ([38], Vol.2, pp. 847-868) 

30 The phase space for classical spin is a 2-sphere, which is compact and therefore leads 
to a finite-dimensional Hilbert space upon quantisation. 
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We begin by replacing the proper orthochronous Lorentz group by its 
double (= universal) cover SL(2,C) in order to include half-integer spin 
fields. We stress that everything that follows merely requires the invariance 
under this group. No requirements concerning invariance under space- or 
time reversal are needed! 

We recall from the previous section that any finite-dimensional complex 
representation of SL(2,C) is labelled by an ordered pair (p, q), where p and q 
may assume independently all non- negative integer or half-integer values. 2p 
and 2q correspond to the numbers of 'unprimed' and 'primed' spinor indices, 
respectively. The tensor product of two such representations decomposes as 
follows: 

p+p' q+q' 

D (P,q) 0D (p',q') = D (r,s) ( (22) 

T =\p—p'\ s=|q — q'| 

where — and this is the important point in what follows — the sums proceed 
in integer steps in r and s. With each D' p,q ' let us associate a 'Pauli Index', 
given by 

7i : D (p - q) -» ((-l) 2p , (-l) 2q ) € Z 2 xZ 2 . (23) 

This association may be extended to sums of such D' p ' q ' proceeding in 
integer steps, simply by assigning to the sum the Pauli Index of its terms 
(which are all the same). Then we havd^l 

7T(D (p ' q) ® D (p '' q,) ) = 7T(D (p - q) ) • 7t(D (p '- q,) ) . (24) 

According to their representations, we can associate a Pauli Index with 
spinors and tensors. For example, a tensor of odd/even degree has Pauli 
Index (—,—)/(+,+). The partial derivative, 3, counts as a tensor of degree 
one. Now consider the most general linear (non interacting) field equations 
for integer spin (here and in what follows ?"(•••) simply stands for "sum of 
terms of the general form (• • • )"): 

L»(-H*(-h + ) = !>(-.-). 

Z 9 (->-^(-,-) = L^(+,+)- 

These are invariant under 

(26) 

Next consider any current that is a polynomial in the fields and their deriva- 
tives: 

J(-H = T_ V(-H + + 3(-,-^(+,+) (27) 

+¥ ( _ H 3 ( _ i _ ) ¥ ( _,_ ) + • • • 




31 This may be expressed by saying that the map n is a homomorphism of semigroups. One 
semigroup consists of direct sums of irreducible representations proceeding in integer 
steps with operation ®, the other is Z2 x Z2, which is actually a group. 
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Then one has 



(®J)(x)=-J(-x) 



(28) 



This shows that for any solution of the field equations with charge Q for 
the conserved current J (Q being the space integral over J°) there is another 
solution (the transformed) with charge — Q. It follows that charges of 
conserved currents cannot be sign-definite in any SL(2,C)-invariant theory 
of non- interacting integer spin fields. In the same fashion one shows that 
conserved quantities, stemming from divergenceless symmetric tensors of 
rank two, bilinear in fields, cannot be sign-definite in any SL(2,C) invariant 
theory of non-interacting half-integer spin fields. In particular, the conserved 
quantity in question could be energy! 

An immediate but far reaching first conclusion (not explicitly drawn by 
Pauli) is that there cannot exist a relativistic generalisation of Schrodinger's 
one- particle wave equation. For example, for integer-spin particles, one 
simply cannot construct a non-negative spatial probability distribution de- 
rived from conserved four-currents. This provides a general argument for 
the need of second quantisation, which in textbooks is usually restricted to 
the spin-zero case. 

Upon second quantisation the celebrated spin-statistics connection for 
free fields can now be derived in a few lines. It says that integer spin fields 
cannot be quantised using anti-commutators and half-integer spin field can- 
not be quantised using commutators. Here the so-called Jordan-Pauli dis- 
tribution plays a crucial rol^fl in the (anti)commutation relations, which 
ensures causality (observables localised in spacelike separated regions com- 
mute). Also, the crucial hypothesis of the existence of an SL(2, C) invariant 
stable vacuum state is adopted. Pauli ends his paper by saying: 

In conclusion we wish to state, that according to our opinion the 
connection between spin and statistics is one of the most impor- 
tant applications of the special relativity theory. ([51] : p. 722) 

It took almost 20 years before first attempts were made to generalise this 
result to the physically relevant case of interacting fields by Luders & Zu- 
mino [12]. 

32 The Jordan-Pauli distribution was introduced by Jordan and Pauli in their 1927 paper 
Quantum Electrodynamics of Uncharged Fields ("Zur Quantenelektrodynamik ladungs- 
freier Felder"; [38], Vol.2, pp. 331-353) in an attempt to formulate Quantum Electro- 
dynamics in a manifest Poincare invariant fashion. It is uniquely characterised (up to a 
constant factor) by the following requirements: (1) it must be Poincare invariant under 
simultaneous transformations of both arguments; (2) it vanishes for spacelike separated 
arguments; (3) it satisfies the Klein-Gordon equation. The (anti)commutators of the 
free fields must be proportional to the Jordan-Pauli distribution, or to finitely many 
derivatives of it. 
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3.4 The meaning of 'general covariance' 

General covariance is usually presented as the characteristic feature of Gen- 
eral Relativity. The attempted meaning is that a generally covariant law 
takes the 'same form' in all spacetime coordinate systems. However, in or- 
der to define the 'form' of a law one needs to make precisely the distinction 
between background entities, which are constitutive elements of the law, and 
the dynamical quantities which are to be obey the laws so defined (cf. Sec- 
tion l2.ip . In the language we introduced above, 'general covariance' cannot 
just mean simple covariance under all smooth and invertible transforma- 
tions of spacetime points, i.e. that the spacetime diffeomorphism group is a 
covariance group is the sense of Definition[2j for that would be easily achiev- 
able without putting any restriction on the intended law proper, as was al- 
ready pointed out by Erich Kretschmann in 1917 [37J. Einstein agreed with 
that criticism of Kretschmann's, which he called "acute" (German original: 
"scharfsinnig" ) ( [BS] . Vol. 7, Doc. 4, pp. 38-41), and withdrew to the view that 
the principle of general covariance has at least some heuristic power in the 
following sense! 33 ! 

Between two theoretical systems which are compatible with expe- 
rience, that one is to be preferred which is the simpler and more 
transparent one from the standpoint of the absolute differential 
calculus. Try to bring Newton's gravitational mechanics in the 
form of generally covariant equations (four dimensional) and one 
will surely be convinced that principle al 3 ^! is, if not theoretically, 
but practically excluded. 

But the principle of general covariance is intended as a non-trivial selec- 
tion criterion. Hence modern writers often characterise it as the requirement 
of diffeomorphism invariance, i.e. that the diffeomorphism group of space- 
time is a symmetry group in the sense of Definition[TJ But then, as we 
have seen above, the principle is open to trivialisations if one allows back- 
ground structures to become formally dynamical. This possibility can only 
be inhibited if one limits the amount of structure that may be added to the 

33 German original: "Von zwei mit der Erfahrung vereinbaren theoretischen Systemen 
wird dasjenige zu bevorzugen sein, welches vom Standpunkte des absoluten Differen- 
tialkalkiils das einfachere und durchsichtigere ist. Man bringe einmal die Newtonsche 
Gravitationsmechanik in die Form von kovarianten Gleichungen (vierdimensional) und 
man wird sicherlich iiberzeugt sein, dafi das Prinzip a) diese Theorie zwar nicht theo- 
retisch, aber praktisch ausschliefit." ([66], Vol.7, Doc. 4, p. 39) 

34 Einstein formulates principle a) thus: "Principle of relativity: The laws of nature exclu- 
sively contain statements about spacetime coincidences; therefore they find their natural 
expression in generally covariant equations." ([66], Vol. 7, Doc. 4, p. 38) 
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dynamical fields Q 

The reason why I mention all this here is that Pauli's Relativity article is, 
to my knowledge, the only one that seems to address that point, albeit not 
as explicitly as one might wish. After mentioning Kretschmann's objection, 
he remarks (the emphases are Pauli's) @ 

The generally covariant formulation of the physical laws acquires 
a physical content only through the principle of equivalence, in 
consequence of which gravitation is described solely by the 
and the latter are not given independently from matter, but are 
themselves determined by the field equations. Only for this reason 
can the gtic be described as physical quantities. (|54|. p. 150) 

Note how perceptive Pauli addresses the two central issues: 1) that one 
has to limit the the amount of dynamical variables and 2) that dynamical 
structures have to legitimate themselves as physical quantities through their 
back reaction onto other (matter) structures. It is by far the best few- 
line account of the issue that I know of, though perhaps a little hard to 
understand without the more detailed discussion given above in Section l2~2l 
Most modern textbooks do not even address the problem. See [25] for more 
discussion. 

3.5 General covariance and antimatter 

In this section I wish to give a brief but illustrative example from Pauli's 
work for the non-trivial distinction between observable physical symmetries 
on one hand, and gauge symmetries on the other (cf. Section [2T3|) . The 
example I have chosen concerns an argument within the (now outdated) 
attempts to understand elementary particles as regular solutions of classical 
field equations. Pauli reviewed such attempts in a rather detailed fashion in 
his Relativity article, with particular emphasis on Weyl's theory, to which 
he had actively contributed in two of his first three published papers in 1919. 

The argument proper says that in any 'generally covariant f^l theory, 
which allows for regular static solutions representing charged particles, there 

35 Physically speaking, one may be tempted to just disallow such formal 'equations of 
motions' whose solution space is (up to gauge equivalence) zero dimensional. But this 
would mean that one would have to first understand the solution space of a given theory 
before one can decide on its 'general covariance' properties, which would presumably 
render it a practically fairly useless criterion. 

36 German original: Einen physikalischen Inhalt bekommt die allgemein kovariante For- 
mulierung der Naturgesetze erst durch das Aquivalenzprinzip, welches zur Folge hat, dafi 
die Gravitation durch die gtk allein beschrieben wird, und dafi diese nicht unabhangig 
von der Materie gegeben, sondern selbst durch Feldgleichungen bestimmt sind. Erst de- 
shalb konnen die gn< als physikalische Zustandsgrdfien bezeichnet werden. ([58], p. 181) 

37 Here 'general covariance' is taken to mean that the diffeomorphism group of spacetime 
acts as symmetry group. 
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exists for any solution with mass m and charge e another such solution with 
the same mass but opposite charge — e. Pauli's proof looks like an almost 
trivial application of general covariance and runs as follows: Let g,xv(x A ) and 
A^(x A ) represent the gravitational and electromagnetic field respectively. 
The hypothesis of staticity implies that coordinates (and gauges for A^) 
can be chosen such that all fields are independent of the time coordinate, 
x°, and that got = as well as A^ = for i = 1,2,30 Now consider 
the orientation-reversing diffeomorphism ct) : ( x °, t— > (— x°,x). It maps 
the gravitational field to itself while reversing the sign of Ao and hence of 
the electric field. General covariance assures these new fields to be again 
solutions with the same total mass but opposite total electric charge. 

Pauli presents this argument in his second paper addressing Weyl's 
theory, entitled To the Theory of Gravitation and Electricity by Hermann 
Weykz] ([31], Vol. 2, pp. 13-23, here p. 18) and also towards the end of Sec- 
tion 67 of his Relativity article. The idea of this proof is due to Weyl who 
communicated it (without formulae) in his first two letters to Pauli ( [_45_|. 
Vol. 1, Doc. [1] and [2]), as Pauli also acknowledges in his paper ([38], Vol. 2, 
p. 18, footnote 2). 

It is interesting to note that Einstein rediscovered the very same argu- 
ment in 1925 and found it worthy of a separate communication [13J. At 
the time it was common to all, Weyl, Pauli, and Einstein, to regard the ar- 
gument a nuisance and of essentially destructive nature. This was because 
at this time antiparticles had not yet been discovered so that the apparent 
asymmetry as regards the sign of the electric charges of fundamental par- 
ticles was believed to be a fundamental property of Nature. Already in his 
first paper on Weyl's theory ([38], Vol. 2, pp. 1-9), entitled Perihelion Motion 
of Mercury and Deflection of Rays in Weyl 's Theory of GravitatioT^l. Pauli 
emphasised 

The main difficulty [with Weyl's theory] is - apart from Ein- 
stein's objection, which appears to me not yet sufficiently dis- 
proved - that the theory cannot account for the asymmetry be- 
tween the two sorts of electricity. 

Now, there is an interesting conceptual point hidden in this argument 
that relates to our discussions in Sections l2.2l and 12.31 First of all, the two 

38 The latter conditions distinguish staticity from mere stationarity. The condition on At 
may, in fact, be relaxed. 

39 German original: "Zur Theorie der Gravitation und der Elektrizitat von Hermann 
Weyl". 

40 German original: "Merkurperihelbewegung und Strahlenablenkung in Weyls Gravita- 
tionstheorie" . 

41 German original: "Die Hauptschwierigkeit ist - neben Einstein's Einwand, der mir dur- 
chaus noch nicht hinreichend widerlegt scheint -, dafi die Theorie von der Asymmetrie 
der beiden Elektrizitatsarten nicht befriedigend Rechenschaft zu geben vermag." ([38], 
Vol. 2, p. 8) 
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solutions are clearly considered physically distinct, otherwise the argument 
could not be understood as contradicting the charge asymmetry in Nature. 
Hence the diffeomorphism involved cannot be considered a gauge trans- 
formation but rather corresponds to a proper physical symmetry. On the 
other hand, we know that diffeomorphisms within bounded regions must be 
considered as gauge transformations, for otherwise one would run into the 
dilemma set by the so-called "hole argument^ Hence one faces the prob- 
lem of how one should characterise those diffeomorphisms which are not to 
be considered as gauge transformations (cf. Section lXB"]) . It is conceivable 
that this question is not decidable without contextual information. (See e.g. 
|23j and Chapter 6 of [33J for more discussion of this point.) The historical 
sources have almost nothing to say about this, though there are suggestions 
by all three mentioned authors how to circumvent the argument by adding 
more non-dynamical structures, as a result of which general covariance is 
lost. Einstein, being most explicit here, suggested the existence of a time- 
like vector field which fixes a time orientation. At least the so-defined time 
orientation would then have to be considered as non-dynamical structure 
of type L (cf. Section [T2j) in order to break the symmetry group down 
to the stabiliser group of L. The time-orientation-reversing transformation 
used above would then not be a symmetry anymore. Similar suggestions 
were made by Weyl, who also hinted at a structure to distinguish past and 
future@ 

Their essential difference [of past and future] I take, contrary to 
most physicists, to be a fact of much more fundamental meaning 
than the essential difference between positive and negative charge. 

In the last (5th) edition of Raum Zeit Materie, Hermann Weyl writes re- 

42 Let Q be a bounded region in spacetime which is disjoint from a spacelike hypersurface 
£. Consider two solutions to the field equations which merely differ by the action of a 
diffeomorphism 4> with support in CI. If they are considered distinct, then the theory 
cannot have a well posed initial-value problem, since then for any L distinct solutions 
exist with identical data on L. This is a rephrasing of Einstein's original argument ( [66] . 
Vol.4, Doc. 25, p. 574, Doc. 26, p580, Vol.6, Doc. 2, p. 10), which did not construct a 
contradiction to the existence of a well posed initial-value problem, but rather to the 
requirement that the gravitational field be determined by the matter content (more 
precisely: its energy momentum tensor). But this requirement is clearly never fulfilled 
in any generally covariant theory in which the gravitational field has its own degrees 
of freedom, independent of whether one regards diffeomorphisms as gauge. Slightly 
later he rephrased it so as to construct a contradiction to the existence of a well posed 
boundary-value problem ( 66 , Vol.6, Doc. 9, p. 110), which is also not the right thing 
to require from equations that describe the propagation of fields with own degrees of 
freedom. 

43 German original: "Ihren Wesensunterschied [von Vergangenheit und Zukunft] halte ich, 
im Gegensatz zu den meisten Physikern, fur eine Tatsache von noch viel fundament alerer 
Bedeutung als der Wesensunterschied zwischen positiver und negativer Elektrizitat." 
([45], Vol.1, Doc. [2], p. 6) 
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garding his unified theory (the emphases are Weyl's): @ 

The theory gives no clue as regards the disparity of positive 
and negative electricity. But that cannot be taken as a reproach 
against the theory. For that disparity is based without doubt on 
the fact that of both fundamental constituents of matter, the elec- 
tron and the hydrogen nucleus, the positively charged one is tight 
to another mass then the negatively charged one; it originates 
from the nature of matter and not of the field. 

Given that Weyl is talking about his unified field-theory of gravity and 
electricity, whose original claim was to explain all of matter by means of 
field theory, this statement seems rather surprising. It may be taken as a 
sign of Weyl's beginning retreat from his once so ambitious programme. 

3.6 Missed opportunities 
3.6.1 Supersymmetry 

One issue that attracted much attention during the 1960s was, whether the 
observed particle multiplets could be understood on the basis of an all em- 
bracing symmetry principle that would combine the Poincare group with 
the internal symmetry groups displayed by the multiplet structures. This 
combination of groups should be non-trivial, i.e., not be a direct product, for 
otherwise the internal symmetries would commute with the spacetime sym- 
metries and lead to multiplets degenerate in mass and spin (see, e.g., [49J ) . 
Subsequently, a number of no-go theorems appeared, which culminated in 
the now most famous theorem of Coleman & Mandula [llj . This theorem 
states that those generators of symmetries of the S-matrix belonging to the 
Poincare group necessarily commute with those belonging to internal sym- 
metries. The theorem is based on a series of assumption^! involving the 
crucial technical condition that the S-matrix depends analytically on stan- 
dard scattering parameters. What is less visible here is that the structure of 
the Poincare group enters in a decisive way. This result would not follow for 

44 German original: "Die Theorie gibt keinen Aufschlufi uber die Ungleichartigkeit von 
positiver und negativer Elektrizitat. Das kann ihr aber nicht zum Vorwurf gemacht 
werden. Denn jene Ungleichartigkeit beruht ohne Zweifel darauf, daB von den beiden 
Urbestandteilender der Materie, Elektron und Wasserstoffkern, der positiv geladene mit 
einer anderen Masse verbunden ist als der negaiv geladene; sie entspringt aus der Natur 
der Materie und nicht des Feldes." ([71], p. 308) 

45 The assumptions are: (1) there exists a non-trivial (i.e., 7^ 1) S-matrix which depends 
analytically on s (the squared centre-of-mass energy) and t (the squared momentum 
transfer); (2) the mass spectrum of one-particle states consists of (possibly infinite) 
isolated points with only finite degeneracies; (3) the generators (of the Lie algebra) of 
symmetries of the S-matrix contains (as a Lie-sub algebra) the Poincare generators; 
(4) some technical assumptions concerning the possibility of writing the symmetry gen- 
erators as integral operators in momentum space. 
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the Galilean group, as was explicitly pointed out by Coleman & Mandula 

am, p. 159). 

One way to avoid the theorem of Coleman & Mandula is to generalise 
the notion of symmetries. An early attempt was made by Golfand & Likht- 
man [29j , who constructed what is now known as a Super-Lie algebra, which 
generalises the concept of Lie algebra (i.e. symmetry generators obeying cer- 
tain commutation relations) to one also involving anti-commutators. In this 
way it became possible for the first time to link particles of integer and half- 
integer spin by a symmetry principle. It is true that Supersymmetry still 
maintains the degeneracy in masses and hence cannot account for the mass 
differences in multiplets. But its most convincing property, the symmetry 
between bosons and fermions, suggested a most elegant resolution of the 
notorious ultraviolet divergences that beset Quantum Field Theory. 

It is remarkable that the idea of a cancellation of bosonic and fermionic 
contributions to the vacuum energy density occurred to Pauli. In his lectures 
Selected Topics in Field Quantization, delivered in 1950-51 (in print again 
since 2000, [59]), he posed the question 

..whether these zero-point energies [from Bosons and Fermions] 
can compensate each other. ([59], p. 33) 

He tried to answer this question by writing down the formal expression for 
the zero point energy density of a quantum field of spin ) and mass irij > 
(Pauli restricted attention to spin and spin 1 /2, but the generalisation is 
immediate) : 

47t 2 ^ = (-i)3(2j + r 



dkkVk 2 + m 2 . (29) 



Cancellation should take place for high values of k. The expansion 

rK 

4 dkkVk 2 + m 2 = K 4 + m 2 K 2 - m 4 log(2K/mj) + 0(K _1 ) (30) 
Jo 

shows that the quartic, quadratic, and logarithmic terms must cancel in the 
sum over ) for the limit K — > oo to exist. This implies that for n = 0,2,4 
one must have 

}j-1) 2 '(2j + 1)m^ = and Jj-l ) 2 ' (2j + 1 ) log(mj) = . (31) 
) ) 

Pauli comments that 

these requirements are so extensive that it is rather improbable 
that they are satisfied in reality. ([59]. p. 33) 

Unless enforced by an underlying symmetry, one is tempted to add! This 
would have been the first call for a supersymmetry in the year 1951. 
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However, the real world does not seem to be as simple as that. Super- 
symmetry, if at all existent, is strongly broken in the phase we live in. So far 
no supersymmetric partner of any existing particle has been detected, even 
though some of them (e.g., the neutralino) are currently suggested to be vi- 
able candidates for the missing-mass problem in cosmology. Future findings 
(or non-findings) at the Large Hadron Collider (LHC) will probably have a 
decisive impact on the future of the idea of super symmetry, which — whether 
or not it is realised in Nature — is certainly very attractive; and Pauli came 
close to it. 

3.6.2 Kaluza-Klein Monopoles 

Ever since its first formulation in 1921, Pauli as well as Einstein were much 
attracted by the geometric idea of Theodor Kaluza and its refinement by 
Oskar Klein, according to which the classical theories of the gravitational 
and the electromagnetic field could be unified into a single theory, in which 
the unified field has the same meaning as Einsteins gravitational field in 
General Relativity, namely as metric tensor of spacetime, but now in five 
instead of four dimensions. The momentum of a particle in the additional 
fifth direction (which is spacelike) is now to be interpreted as its charge. 
Charge is conserved because the geometry of spacetime is a priori restricted 
to be independent of that fifth direction. The combined field equations 
are exactly the five-dimensional analog of Einstein's equations for General 
Relativity. 

A natural question to address in this unified classical theory was whether 
it admits solutions that could represent particle- like objects. More precisely, 
the solution should be stationary, everywhere regular, and possess long- 
ranging gravitational and electromagnetic fields (usually associated with 
aspects of mass and charge). Pauli, who was very well familiar with this 
theory since its first appearanc^l, kept an active interest in it even after 
the formulations of Quantum Mechanics and early Quantum Electrodynam- 
ics, which made it unquestionable for him that the problem of matter could 
not be adequately addressed in the framework of a classical field theory, 
unlike Einstein, who maintained such a hope in various forms until the end 
of his life in 1955. 

It is therefore remarkable that in 1943 Einstein and Pauli wrote a paper 
in which they proved the non-existence of such solutions. The introduction 
contains the following statement: 

When one tries to find a unified theory of the gravitational and 
electromagnetic fields, he cannot help feeling that there is some 
truth in Kaluza's five- dimensional theory. P- 131) 

46 It came out too late to be considered in the first edition of Pauli's Relativity article. But 
he devoted to it a comparatively large space in his Supplementary Notes written in early 
1956 for the first English edition ( 54 , Suppl. Note 23, pp. 227-232; [55], pp. 276-282) 
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In fact, Einstein and Pauli offered a proof for the more general situation with 
an arbitrary number of additional space dimensions, fulfilling the generalised 
Kaluza-Klein "cylinder-condition" that the gravitational field should not de- 
pend on any of these extra directions. Note that this extra condition intro- 
duces non-dynamical background structures, so that of the 5-dimensional 
diffeomorphism group only those diffeomorphisms preserving this condition 
can act as symmetries, a point Pauli often emphasised as a deficiency re- 
garding the Kaluza-Klein approach. 

Restricting attention to five dimensions, the explicitly stated hypotheses 
underlying the proof were these ([14j. p. 131; annotations in square brackets 
within quotations are mine): 

HI "The field is stationary (i.e the gik [the five-dimensional metric] are 
independent of x 4 [the time coordinate])." Clearly, g^ is also assumed 
to be independent of the fifth coordinate x 5 . 

H2 "It [the field gtic] is free from singularities." 

H3 "It is imbedded in a Euclidean space (of the Minkowski type), and for 
large values of r (r being the distance from the origin of the spatial 
coordinate system) g44 has the asymptotic form g44 = —1 +|x/r, where 
[L ^ 0." The last condition is meant to assure the non-triviality of the 
solution, i.e. that there really is an attracting object at the spatial 
origin. This becomes clear if one recalls that in the lowest weak-field 
and slow- motion approximation 1 + just corresponds to the Newto- 
nian gravitational potential. Unfortunately, the other statement: "It 
is imbedded in a Euclidean space (of the Minkowski type)" seems am- 
biguous, since the solution is clearly not meant to be just (a portion of) 
5-dimensional flat Minkowski space. Hence the next closest reading is 
presumably that the underlying five-dimensional spacetime manifold 
is (diffeomorphic to) R 5 , with some non-flat metric of Minkowskian 



The elegant method of proof makes essential use of the fact that the suit- 
ably restricted group of spacetime diffeomorphisms (to those preserving the 
cylinder condition) is a symmetry group for the full set of equations in the 
sense of ([3]) of Definition[TJ More precisely, two types of diffeomorphisms 
from that class are considered separately by Einstein and Pauli: 

Dl Arbitrary ones in the three coordinates (x 1 ,x 2 ,x 3 ) which leave invari- 
ant the (x 4 ,x 5 ) coordinates. 

D2 Linear ones in the (x 4 , x 5 ) coordinates, leaving invariant the 



In fact, it turns out that formally the proof does not depend on whether the fifth 
dimension is space- or time-like, as noted by Einstein and Pauli ([H], p. 134). 




(x\x 2 ,x 3 ). 
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Now, as a matter of fact, this innocent looking split introduces a fur- 
ther and, as it turns out, crucial restriction, over and above the hypotheses 
H1-H3. The point is that the split and, in particular, the set D2 of dif- 
feomorphisms simply do not exist unless the spacetime manifold, which in 
H3 was assumed to be R 5 , globally splits into R x R 3 such that the first 
factor, R 2 , corresponds to the x 4 x 5 -planes of constant spatial coordinates 
(x 1 ,x 2 ,x 3 ) and the second factor, R 3 , corresponds to the x 1 x 2 x 3 -spaces of 
constant coordinates (x 4 ,x 5 ). But this need not be the case if H1-H3 are 
assumed. The identity derived by Einstein and Pauli from the requirement 
that transformations of the field induced by diffeomorphisms of the type D2 
are symmetries are absolutely crucial in proving the non-existence of regular 
solutions! 4 ^! 

We now know that this additional restriction is essential to the non- 
existence result: There do exist solutions of the type envisaged that satisfy 
H1-H3, but violate the extra (and superfluous) splitting conditional They 
are called Kaluza-Klein Monopoles |65j [31] and carry a gravitational mass 
as well as a magnetic charge. It is hard to believe that Pauli as well as 
Einstein would not have been much impressed by those solutions, though 
possibly with different conclusions, had they ever learned about them. It is 
also conceivable that these solutions could have been found at the time, had 
real attempts been made, rather than — possibly — discouraged by Pauli's and 
Einstein's result. In fact, Kurt Godel, who was already in Princeton when 
Pauli visited Einstein, found his famous cosmological solution [26] in 1949 
by a very similar geometric insight that also first led to the Kaluza-Klein 
monopole [65jF^l 

3.7 Irritations and psychological prejudices 

One of Pauli's major interests were discrete symmetries, in particular the 
transformation of space inversion, x i— > — x, also called parity transforma- 
tion. Given a linear wave equation which is symmetric under the proper 
orthochronous (i.e. including no space and time inversions) Poincare group, 

48 Specifically we mean their identity (13), which together with spatial regularity implies 
the integral form (13a), which in turn leads directly to vanishing mass in (22-23a). (All 
references are to their formulae in [14].) 

49 The somewhat intricate topology of the Kaluza-Klein spacetime is this: The x 5 coordi- 
nate parametrises circles which combine with the 2-spheres (polar coordinates (0,(p)) 
of constant spatial radius, r, into 3-spheres (Hopf fibration) which are parametrised by 
(8, cp,x 5 ), now thought of as Euler angles. The radii of these 3-spheres appropriately 
shrink to zero as r tends to zero, so that (r, 9, cp,x 5 ) define, in fact, polar coordinates 
of R 4 . Together with time, x 4 , we get R 5 as global topology. Now the submanifolds of 
constant (x 1 ,x 2 ,x 3 ) are those of constant (r, 8,cp) and have a topology R x S' rather 
than R 2 , so that the linear transformations D2 in the x 4 x 5 coordinates do not define 
diffeomorphisms of the Kaluza-Klein spacetime manifold. 

50 Both use invariant metrics on 3-dimensional group manifolds, SU(2) in the KK case, 
SU(1 , 1 ) in Godels case. This simplifies the calculations considerably. 
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one may ask whether it is also symmetric under space and time inversions. 
For this to be a well defined question one has to formulate conditions on 
how these inversions interact with Poincare transformations. Let us focus 
on the operation of space inversion. If this operation is implementable by 
an operator V, it must conjugate each rotation and each time translation 
to their respective self, and each boost and each space translation to their 
respective inverse. This follows simply from the geometric meaning of space 
inversion. Hence, generally speaking, we need to distinguish the following 
three possible scenarios (recall the notation from Section [2~2"j) : 

(a) V acts on fC and is a symmetry, i.e. leaves 2?£ C K invariant; 

(b) V acts on fC and is no symmetry, i.e. leaves T>z C K, not invariant; 

(c) V is not implementable on /C. 

It is clear that when one states that a certain equation is not symmetric 
under V one usually addresses situation (b), though situation (c) also occurs, 
as we shall see. 

Consider now the field of a massless spin--^ particle, that transforms 
irreducibly under the proper orthochronous Poincare group. The field is 
then either a two-component spinor cb A , which in the absence of interactions 
obeys the so-called Weyl equation 51 ! 

3aa'* A = 0. (32) 

Alternatively, one may also start from a four-component Dirac spinor, 

♦-(£) 

which carries a reducible representation of the proper orthochronous 
Poincare group: If cb A transforms with A 6 SL(2,C) then Xa' transforms 
with (A^)~ 1 (being an element of the complex-conjugate dual space), so 
that the space of the upper two components cb A of 4> and the space of the 
lower two components Xa' °f ^ are separately invariant. One may then 
eliminate two of the four components by the so-called Majorana condition, 
which requires the state i\> to be identical to its charge-conjugate, ib c , where 



C : i|> - V :=iy 2 iK = [£) . (31) 



a 



51 Here I use the standard Spinor notation where upper-case capital Latin indices refer to 
(components of) elements in spinor space (2-dimensional complex vector space), lower 
case indices to the dual space, and primed indices to the respective complex-conjugate 
spaces. Indices are raised and lowered by using a (unique up to scale) SL(2,C) invariant 
2-form. An overbar denotes the map into the complex-conjugate vector space. Unless 
stated otherwise, my conventions are those of |64] , 
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Hence for a Majorana spinor one has (b = % and the interaction-free Dirac 
equation reads 

One can now either regard (I32p or (I35p as the interaction-free equation for 
a neutrino. 

Here I wish to briefly recall a curious discussion between Pauli and Fierz 
on whether or not these two equations describe physically different state of 
affairs. Superficially this discussion is about a formal and, mathematically 
speaking, rather trivial point. But, as we will see, it relates to deep-lying 
preconceptions in Pauli's thinking about issues of symmetry. This makes it 
worth looking at this episode in some detail. 

First note that there is an obvious bijection, |3, between two-component 
spinors and Majorana spinors, given by 

P (36, 

Note also that the set of Majorana spinors is a priori a reajf! vector space, 
though it has a complex structure, j, given by 

KIXX). 

with respect to which the bijection (|36p satisfies |3 o i = ) o |3, where here 
i stands for the standard complex structure (multiplication with imaginary 
unit i) in the space C of two-component spinors. However, regarded as a 
map between complex vector spaces, the bijection (3 is not linear. 

Now, Pauli observed already in 1933 (see quotation below) that the Weyl 
equation (|32|) is not symmetric under parity. Hence he concluded it could 
not be used to describe Nature. In fact, what is actually the case is that 
parity cannot even be implemented as a linear map on the space of two- 
component spinors (case (c) above). This is easy to see and in fact true for 
any irreducible representation of the Lorentz group that stays irreducible if 
restricted to the rotation group (i.e. for purely primed or purely unprimed 
spinors) 1^1 

52 The reality structure on the complex vector space of Dirac spinors is provided by the 
charge conjugation map. 

53 As stated above, the geometric meaning of space inversion requires that the parity 
operator (if existent) commutes with spatial rotations and conjugates boosts to their 
inverse. The first requirement implies (via Schur's Lemma) that it must be a multiple 
of the identity in any irreducible representation that stays irreducible when restricted to 
the rotation subgroup, which contradicts the second requirement. Hence it cannot exist 
in such representations, which are precisely those with only unprimed or only primed 
indices. 
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On the other hand, the Dirac equation is symmetric under space inver- 
sions. Indeed, the spinor-map corresponding to the inversion in the spatial 
plane perpendicular to the timelike normal n is given by 

P:^^^v .-^n^^opn), (38) 

where p n : H — x^ + 2u^(rt v x A ') and where u is a complex number of unit 
modulus, called the intrinsic parity of the particular field ib. It is easy to 
see that V is a symmetry of (j35j) for any r\. Note that V 2 = r\ 1 so that 
T) E {1,-1, i, — i}, since for spinors one only requires V 2 = ±1 (rather than 
V 2 = 1). It is also easy to verify that V commutes with C iff r\ = ±i. So if 
we assign imaginary parity to the Majorana fields, the operator V also acts 
on the subspace of Majorana spinors. We conclude that the free Majorana 
equation is parity invariant. 

Hence it seems at first that the Weyl formulation and the Majorana 
formulation differ since they have different symmetry properties. But this 
is not true. Using the bijection (I36p . we can pull-back the parity map (I38p 
to the space of two-component spinors, where it becomes (now either n = i 
or n = — i) 

ch A ^ riV2u AA '(^ A ,op n ), (39) 

which is now an anti-linear map on the space of two-component spinors. 

All this was essentially pointed out to Pauli by Markus Fierz in a letter 
dated February 6th 1957 ([45J Vol. IV, Part IV A, Doc. [2494], p. 171) in con- 
nection with Lee's and Yang's two-component theory of the neutrino. Fierz 
correctly concluded from this essential equivalence 55 ! that the 2-component 
theory as such (i.e. without interactions) did not warrant the conclusion of 
parity violation; only interactions could be held responsible for that. 

This was a relevant point in the theoretical discussion at the time, as can 
be seen from the fact that there were two independent papers published in 
The Physical Review shortly after Fierz's private letter to Pauli, containing 
the very same observation. The first paper was submitted on February 13th 
by McLennan [33], the second on March 25th by Case [9]. In fact, Serpe 
made this observation already in 1952 [52] and emphasised it once more in 
1957 [63]. 

One might be worried about the anti-linearity of the transformation in 
(I39p . In that respect, also following Fierz, an illuminating analogy may be 
mentioned regarding the vacuum Maxwell equations, which can be written 
in the form 

I3 t 6 - V x ® = , V • 6 = , (40) 

where 

6 := E + IB (41) 

54 Which is also the standard choice in QFT; see e.g. [69], pp. 126,226. 

55 Meaning the existence of a bijection that maps all quantities of interest (states, currents, 
symmetries) of one theory to the other. 
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is a complex combination of the electric and magnetic field. Both equations 
(|40p are clearly equivalent of the full set of Maxwell's equations. It can 
be shown that spatial inversions cannot be implemented as complex-linear 
transformations on the complex- valued field But, clearly, we know 

that Maxwell's equations are parity invariant, namely if we transform the 
electric field as E H — E o p ('polar' vector-field) and the magnetic field as 
Bh Bop ('axial' vector- field) , where p : (t,x) •— > (t, — x). This corresponds 

to an antilinear symmetry of (|4Qj) . given by <J> i— > — O o p. 

Coming back to Fierz's (and other's) original observation for the spinor 
field, they were accepted without much ado by others. For example, in her 
survey on the neutrino in the Pauli Memorial Volume, Madame Wu states 
that "It is the interaction and the interaction only that violates parity" ( [18] , 
footnote on p. 270.). In note 25c of that paper she explicitly thanks Fierz 
for "enlightening discussions" on the two-component theory of the neutrino. 
Clearly Fierz expected his observation to be of interest to Pauli, who had 
already in the 1933 first edition of his handbook article on wave mechanics 
propagated the view that Weyl's two-component equations ard^l 

...not invariant under reflections (interchange of left and right) 
and, as a consequence, not applicable to the physical reality. 

But instead, Pauli reacts with a surprising plethora of ridiculing remarks 

Dear Mr. Fierz! Your letter from the 6th is the biggest blunder 
you ever commited in your life! (Probably this afternoon you 
will send a correction). Have only read the first paragraph of 
your letter which originated in the asylum and was shaking with 
laughter. [...] When this letter arrives (yours I will frame!) you 
probably will already know everything. 

Personal irritations emerged which lasted about one week through several 
exchanges of letters and a phone-call. Finally Pauli essentially conceded 

56 Equations (|40|l are equivalent to 3 AA f ab = 0, where f ab is the unprimed spinor 
equivalent of the tensor for the electromagnetic field strength. Parity cannot be 
linearly implemented on this purely unprimed spinor, for reasons already explained in 
footnote l53l 

57 German original, full sentence: "Indessen sind diese Wellengleichungen, wie ja aus ihrer 
Herleitung hervorgeht, nicht invariant gegeniiber Spiegelungen (Vertauschung von links 
und rechts) und infolge dessen sind sie auf die physikalische Wirklichkeit nicht anwend- 
bar" ([56], p. 234, note 54). The conclusion concerning non applicability to the physical 
reality is cancelled in the 1958 edition; cf. [55], p. 150. 

58 German original: "Lieber Herr Fierz! Ihr Brief vom 6. ist der grofite Bock den Sie 
im Laufe Ihres Lebens geschossen haben! ( Wahrscheinlich kommt heute Nachmittag 
schon eine Berichtigung von Ihnen.) Habe nur den ersten Absatz Ihres der Anstalt 
entsprungenen Briefes gelesen und mich geschiittelt vor lachen. [...] Wenn dieser Brief 
ankommt (Ihren rahme ich ein!), wissen Sie wohl schon alles!" ( 45 , Vol. IV, Part IV A, 
Doc. [2497], p. 179). 
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Fierz's point in a long letter of February 12th 1957 that also contains first 
hints at Pauli's psychological resistances (the emphasis is Pauli's) 1^1 



Your presentation creates in me a feeling of "formal boredom", 
to which the fusillade of laughter was of a compensatory nature. 

This is a curious episode and not easy to understand. Pauli's point seems to 
have been that he wanted to maintain the particle-antiparticle distinction 
independently of parity, whereas Fierz pointed out that the two-component 
theory provided no corresponding structural element: In Weyl's form the 
operations C and V simply do not exist separately, in the Majorana form 
V exists and C is the identity (hence not distinguishing). Psychologically 
speaking, Pauli's point becomes perhaps more understandable if one takes 
into account the fact that since the fall of 1956 he was thinking about the 
question of lepton-charge conservation. Intuitively he had therefore taken 
as self-evident that opposite helicities also corresponded to the particle- 
antiparticle duality (cf. [45J, Vol. IV, Part IV A, Doc. [2497]), even though 
this mental association did not correspond to anything in the equations. In 
a letter dated February 15th 1957 he offered the following in-depth psycho- 
logical explanation to Fierz (the emphases are Pauli's) r°l 

Well, the fusillade of laughter occurred with the expression "Ma- 
jorana Theory" of your first letter. After this catchword I could 
not go on reading. The immediate association with Majorana 
clearly has been this: "aha, particles and antiparticles should no 
longer exist, these one intends to take away from me (as one 
takes away a symbol from somebody)!" This causes me anxiety. 
I also know that since last fall the conservation of lepton charge 

59 German Original: "Ihre Darstellung erzeugt bei mir das Gefiihl der 'formalistischen 
Langeweile", zu der die Lachsalve kompensatorisch war" ([45], Vol. IV, Part IV A, 
Doc. [2510], p. 197). 

60 German original: "Also die 'Lachsalve' erfolgte beim Wort 'Majorana Theorie' Ihres 
ersten Briefes, ich konnte nach diesem Stichwort nicht mehr weiterlesen. Die unmittel- 
bare Assoziation zu Majorana war natiirlich 'aha, Teilchen und Antiteilchen soli es nicht 
mehr geben, die will man wir wegnehmen (wie man jemandem ein Symbol wegnimmt)!' 
Davor habe ich Angst. Ich weiss auch, dafi mir schon seit Herbst die Erhaltung der 
Leptonladung in der Physik ungeheuer wichtig ist - rational betrachtet, vielleicht zu 
wichtig. Ich habe Angst, sie konnte sich als unrichtig herausstellen und, psychologisch 
gesehen, ist 'Unzufriedenheit' ein Euphemismus fur Angst. Die CP - (= Majorana P+ 
Vertauschung von Elektron und Positron) Invarianz ist mir auch wichtig, aber weniger 
wichtig als die Erhaltung der Leptonladung. Es ist sicher wahr, dafi 'mein platonischer 
Spiegelkomplex angestochen' war. Teilchen und Antiteilchen sind das Symbol fur jene 
allgemeine Spiegelung (wie weit sie speziell platonisch ist, dessen bin ich nicht sicher). 
[...] Offenbar hat der 'Spiegelungskomplex' bei mir etwas mit Tod und Unsterblichkeit 
zu tun. Daher die Angst! Ware die Beziehung zwischen dem schlafenden Spiegelbild 
und dem Wachenden gestort, oder waxen sie gar identisch (Majorana), so gabe es, 
psychologisch gesprochen, weder Leben (Geburt) noch Tod." ([45], Vol. IV, Part IV A, 
Doc. [2517], p. 225) 
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in physics was tremendously important to me — looked upon ra- 
tionally probably too important. / am anxious it could turn out 
to be incorrect and, psychologically speaking, "discontentedness" 
is a euphemism for anxiety. The CP (= Majorana P+ exchange 
between electron and positron) invariance is also important to 
me, but less so than the conservation of lepton charge. It is 
certainly true that it "hit upon my Platonic mirror complex". 
Particles and antiparticles are the symbol for that more general 
mirroring (I am not sure to what extent it is particularly pla- 
tonicj. /..._/ Mirroring is also a gnostic symbol for life and death. 
There light is extinguished at birth and lightened up at death. 
[. .] Obviously, for me the "mirroring complex" has something to 
do with death and immortality. Hence the anxiety! If the rela- 
tion between the sleeping mirror image and the one awake would 
be disturbed, or if they would even be identical (Majorana), then, 
psychologically speaking, there would neither be life (birth) nor 
death. 

Fierz later commented on that episode in a personal letter to Norbert Strau- 
mann, parts of which are quoted in [67J. 

3.8 (3-Decay and related issues 
3.8.1 CPT 

In 1955 a collection of essays by distinguished physicists appeared to cele- 
brate Niels Bohr's 70th birthdays [52]. Pauli's contribution ([52], p. 30-51) is 
entitled "Exclusion Principle, Lorentz Group and Reflection of Space-Time 
and Charge" , whose introduction contains the following remarks: 

After a brief period of spiritual and human confusion, caused by 
provisional restriction to "Anschaulichkeit" , a general agreement 
was reached following the substitution of abstract mathematical 
symbols, as for instance psi, for concrete pictures. Especially the 
concrete picture of rotation has been replaced by mathematical 
characteristics of the representations of the group of rotations 
in three dimensional space. This group was soon amplified to 
the Lorentz group in the work of Dirac. [...] The mathemat- 
ical group was further amplified by including the reflections of 
space and time. [...] I believe that this paper also illustrates the 
fact that a rigorous mathematical formalism and epistemological 
analysis are both indispensable in physics in a complementary 
way in the sense of Niels Bohr. While I try to use the former to 
connect all mentioned features of the theory with help of a richer 
"fullness" of plus and minus signs in an increasing "clarity", the 
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latter makes me aware that the final "truth" on the subject is still 
"dwelling in the abyss"^ ([52], p. 30-31) 

In some sense this paper of Pauli's can be seen as a follow-up to his spin- 
statistics paper already discussed above, the main difference being that Pauli 
now considers interacting fields. Pauli now assumes (1) the validity of the 
spin-statistics correlation for interacting fields (for which there was no proof 
at the time), (2) invariance under (the universal cover of) the proper or- 
thochronous Lorentz group SL(2,C) (as in the spin-statistics paper), and 
(3) locality of the interactions (i.e. involving only finitely many derivatives). 
Then Pauli shows that this suffices to derive the so-called CPT theorem that 
states that the combination of charge conjugation (C) and spacetime reflec- 
tion (PT) is a symmetry!^! 

At the time (1955) Pauli wrote his paper it was not known whether any 
of the operations of C, P, or T would separately not be a symmetry. This 
changed when in January 1957 through the experiments of Madame Wu et 
al, in which explicit violations of P and C were seen in processes of beta- 
decay, following a suggestion that this should be checked by Lee and Yang 
in mid 1956 [ID]. Pauli had still offered a bet that this would not happen 
on January 17th 1957 (the emphases are Pauli's) @ 

/ do not believe that God is a weak left-hander and would be 
prepared to bet a high amount that the experiment will show a 
symmetric angular distribution of the electrons ( mirror symme- 
try). For I cannot see a logical connection between the strength 
of an interaction and its mirror symmetry. 

In view of this firm belief in symmetry the following is remarkable: In his 
CPT paper Pauli takes great care to writes down the most general ultralo- 
cal (i.e. no derivatives) four-fermion interaction (for the neutron, proton, 
electron and neutrino), which is not P invariant. In contains 10 essentially 
different terms with ten coupling constants Ci , • • • Cio, only the first five of 
which are parity invariant (scalars), whereas the other five are pseudoscalars, 
i.e change sign under spatial inversions. Apparently this he did just for the 

61 Here Pauli sets the following footnote: "I refer here to Bohr's favourite verses of Schiller: 
'Nur die Fiille fiihrt zur Klarheit / Und im Abgrund wohnt die Wahrheit"'. 

62 Pauli used a now outdated terminology: instead of CPT he uses SR (strong reflection) , 
instead of PT he uses WR (weak reflection) , and instead of C he uses AC (antiparticle 
conjugation). Preliminary versions of the CPT theorem appeared in papers by Julian 
Schwinger (1951)and Gerhard Liiders (1954) to which Pauli refers. Two years after 
Pauli's 1955 paper Res Jost gave a very elegant proof in the framework of axiomatic 
quantum field theory [34] . 

63 German original: "Ich glaube aber nicht, dafi der Herrgott ein schwacher Linkshdnder 
ist und ware bereit hoch zu wetten, dafi das Experiment symmetrische Winkelverteilung 
der Elektronen (Spiegelinvarianz) ergeben wird. Denn ich sehe keine logische 
Verbindung von Starke einer Wechselwirkung und ihrer Spiegelinvarianz." ([45], Vol. IV, 
Part IV A, Doc. [2455] , p. 82) 
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sake of mathematical generality without any physical motivation, as he ex- 
plicitly stated in a letter to Madame Wu dated January 19th 1957 (the 
emphases are Pauli's): 

When I considered such formal possibilities in my paper in the 
Bohr-Festival Volume (1955), I did not think that this could have 
something to do with Nature. I considered it merely as a mathe- 
matical play, and, as a matter of fact, I did not believe in it when 
I read the paper of Yang and Lee. [...] What prevented me until 
now from accepting this formal possibility is the question why this 
restriction of mirroring appears only in the 'weak' interactions, 
not in the strong ones. Theoretically, / do not see any interpre- 
tation of this fact, which is empirically so well established. ( [45] . 
Vol. IV, Part IV A, Doc. [2460], p. 89) 

Lee and Yang took this possibility more serious: In an appendix to their 
paper they also write down all ten terms for the full, parity non-invariant 
interaction (|4(Jj. p. 258), without any citation of Pauli. 

Pauli first learnt that the experiments by Madame Wu et al. had led 
to an asymmetric angular distribution from a letter by John Blatt from 
Princeton, dated January 15th 1957. There Blatt writes: 

/ don't know whether anyone has written you as yet about the 
sudden death of parity. Miss Wu has done an experiment with 
beta-decay of oriented Co nuclei which shows that parity is not 
conserved in (3 decay. [...] We are all rather shaken by by the 
death of our well-beloved friend, parity. ([15], Vol. IV, Part IV A, 
Doc. [2451], p. 74) 

Pauli, too, was shocked as he stated in his famous letter to Weisskopf dated 
January 27/28 1957 (05], Vol. IV, Part IV A, Doc. [2476]). In that very same 
letter Pauli already started speculating how symmetry could be restored by 
letting the constants Ci become dynamical field, scalar fields for i = 1 , • • • ,5 
and pseudo-scalar ones for i = 6, • • • . loF^l 

Let us imagine, for example, the terms with Ci,--- , C5 being 
multiplied with a scalar field $>{x), the terms ■ ■ ■ , C10 mul- 
tiplied with a pseudo-scalar field q>(x). For God Himself, Who 
can change the sign of ct)(x) ; such a theory would be left-right- 
invariant — not for us mortal men, however, who do not know 

64 German original: "Denken wir uns z.B. die Terme mit Ci , ■ ■ ■ , C5 mit einem Skalarfeld 
4)(x), die Terme mit C6, ■ ■ ■ , C10 mit einem Pseudo-Skalarfeld multipliziert. Fur 

den Herrgott, der das Vorzeichen von $(x) umdrehen kann, ware eine solche Theorie 
natiirlich rechts-links-invariant - nicht aber fur uns sterbliche Menschen, die wir gar 
nichts wissen iiber jenes hypothetische neue Feld, aufier dafi es praktisch auf der Erde 
raum-zeitlich konstant (statisch-homogen) ist, und die wir noch kein Mittel haben, es 
zu andern." (gS], Vol. IV, Part IV A, Doc. [2476], pp. 122-123) 



43 



anything about that new hypothetical field, except that it is prac- 
tically constant in space and time on earth (static-homogeneous) , 
and that we do not ye^l have any means to change it. 

The mechanism envisaged here to restore symmetry is just that discussed in 
Section[2?2j where non-dynamical backgrounds structures, L, are (formally) 
turned into dynamical quantities, O. 

3.8.2 The Pauli group 

As already mentioned, since fall of 1956 Pauli's thinking about beta-decay 
was dominated by the lepton-charge conservation. In a paper submitted 
on March 14th 1957, entitled On the Conservation of Lepton Charge ([38], 
Vol.2, pp. 1338-1349), Pauli once more showed his mastery of symmetry 
considerations while keeping everything at the largest possible degree of 
generality. 

He starts by considering the most general ultralocal four-fermion inter- 
actions (not necessarily preserving parity or lepton charge) in which the 
neutrino field is represented by a Dirac 4-spinor, ib. For what follows it is 
convenient to think of the four components of i[> as comprising the following 
four particle states (per momentum): a left-handed neutrino, 4>l, a right- 
handed neutrino, i!>r, and their antiparticles 4>£ and i[>^ respectively. Note 
that this means 4>£ R := (i^l.r) and that accordingly ib£ is right- and i[>^ is 
left-handed. Here we follow the convention of [35]. 

Next Pauli considers a four-parameter group of canonical transforma- 
tions (i.e. they leave the anticommutation relations between the fermion 
fields invariant) of the neutrino field, henceforth called the Pauli group, 
whose interpretation will be given below. These transformations define a 
symmetry of the interaction-free equations of motions (assuming a massless 
neutrino throughout), but will generally not define a symmetry once the 
interaction is taken into account. Rather, the following is true (cf. [48J): 
Suppose that the general interaction depends on a finite number of coupling 
constants Ci for i = 1 , • • • , u and that the equations of motion follow from 
an action principle with Lagrange density L { L | O }, where L represents the 
array of coupling constants (we notationally ignore other non-dynamical 
structures here) and <D the dynamical fields. Then the Pauli group acts as 
covariance in a slightly stronger sense than ([5]), namely so that 

L{g-I|g-(D} = L{I|0}. (42) 

This means that on the level of the Lagrange density (or the Hamiltonian), 
and hence in particular at the level of the equations of motion, the transfor- 
mation of the dynamical fields can be compensated for by a transformation 

65 The "yet" is incorrectly omitted in the official translation ([45], Vol. IV, Part IV A, 
p. 126). 
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of the coupling constants. A large part of Pauli's paper is actually devoted 
to the determination of that compensating action of the Pauli group on the 
array of coupling constants. 

Next suppose the initial state is chosen to be invariant under the Pauli 
group, i.e. g <D = ® for all g. Then (|42f) implies that its evolution with 
interaction parametrised by L (the array of Ci's) is identical to the evolution 
parametrised by g -L for any g. Hence the outcome of the evolution can only 
depend on the c-l's through their Pauli-invariant combinations^ In partic- 
ular, since the neutrinoless double beta-decay simply has no initial neu- 
trino, this reasoning can be applied to it. If this lepton-charge-conservation 
violating process is deemed impossible, the corresponding Pauli-invariant 
combination of coupling constants to which the scattering probability is 
proportional must vanish. This, in turn, gives the sought-after constraint 
on the possible four-fermion interaction. For (massless) neutrinos in Majo- 
rana representation Pauli finally arrived at the result that either only the 
left- or the right-handed component enters the interaction. It should be 
added that this clever sort of reasoning was shortly before used by Pursey 
in a less general setting [61] in which the interaction was specialised a pri- 
ori to conserve lepton charged More on the history of the search for the 
right form of the four-fermion interaction may be found in [57]. It should 
also be mentioned that the possibility of neutrinoless double beta-decays is 
currently still under active experimental investigation at the National Gran 
Sasso Laboratory, where the 2003-2005 CUORICINO experiment set upper 
bounds for the Majorana mass of the electron neutrino well below one eV. 
The upcoming next-generation experiment, CUORE, is designed to lower 
this bound to 5 • 1 0~ 2 meV; compare [30J . 

What is the interpretation of the Pauli group? Mathematically it is 
isomorphic to U(2), the group of 2 x 2 unitary matrices acting on a two- 
dimensional complex vector space. Here there are two such spaces (per 4- 
momentum) in which it acts: the 'left-handed subspace' that is spanned by 
the two left-handed components ibi_ and ib^, and the 'right-handed subspace' 
that is spanned by the two right-handed components 4>r and il>£. The two 

66 For illustrative purposes we argue here as if all fields were classical and obeyed classical 
equations of motion, though Pauli clearly considers the quantum theory where the 
fields become operators. The principal argument is the same, though what makes a big 
difference between the classical and the quantum case is that in the latter we can more 
easily ascertain the existence of invariant initial states. This is because in quantum 
theory, assuming there are no superselection rules at work, the superposition principle 
always allows us to construct invariant initial states by group-averaging any given state 
over the group (which is here compact, so that the averaging is unambiguously defined). 
Such states would, for example, appropriately represent physical situations where those 
observables that distinguish between the states in the group orbit are not measured, 
may it be for reasons of practice or of principle. 

67 It will be a quadratic combination in leading order of perturbation theory. Explicit 
calculations had been done by Pauli's assistant Charles Enz [15] . 

68 In terms of the Pauli group, Pursey did not consider the U(1 ) part. 
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actions of U(2) in these spaces are complex conjugate to each other (see 
equation (|43p ). Usually one thinks of the Pauli group as U(1) x SU(2), 
which is a double cover of U(2), so that the four real parameters are written 
as a phase exp(ia), parametrising U(1), and two complex parameters a, b 
satisfying |a| 2 + |b| 2 = 1, which give three real parameters when split into 
real and imaginary part and which parametrise a 3-sphere that underlies 
SU(2) as group manifold. In this parametrisation the action of the Pauli 
group reads (an asterisk stands for complex conjug ation)@ 



Invariance under the Pauli group is now seen to correspond to an am- 
biguity in the particle-antiparticle distinction. This ambiguity would only 
be lifted by interactions that allowed to distinguish the two left and the 
two right states respectively. In the absence of such interactions the various 
definitions of 'particle' and 'antiparticle' are physically indistinguishable, so 
that the Pauli group acts by gauge symmetries in the sense of Section [2T3l 

Also, the different presentations of the two-component theory, already 
discussed in Section [377l can be seen here. The Majorana condition reads 
il> = 4> c , which in terms of the four components introduced above leads to 

= 4>r and \J)r = This can be read in two different ways, depending 
on whether one addresses ^l^l or t I ; l>4'r as independent basic states. In 
the first case one would say that there is a left-handed neutrino and its 
right-handed antiparticle, whereas in the second case one regards the tuple 
(i^Li^r) as respectively the left- and right-handed components of a single 
particle which is identical to its antiparticle. 

Beyond weak interaction and beta-decay, the Pauli group played a very 
important role in Pauli's brief participation in Heisenberg's programme for 
a unified field theory. It was Pauli who first showed that the (so far classi- 
cal) non-linear spinor equation proposed by Heisenberg was invariant under 
the Pauli group (cf. Heisenberg's account in his letter to Zimmermann from 
Jan. 7th 1958 in [25], Vol. IV, Part IV B, p. 779). In this new context the U(1 ) 
part of the Pauli group was connected to conservation of baryon charge and 
the SU(2) part acquired the meaning of isospin symmetryl 70 ! The central 

69 Usually the Pauli group is written in terms of the 4-component neutrino field i|) as 
4> I— > expfiocYsXcnl) + by5i|) c ), where ij) c := iy2'4'* is the charge conjugate field. But 
this is easily seen to be equivalent to (|43[) if one sets 4>l,r = i(1 ±Ys)4> an( A A>r l — 
2O ± Y5)il) c . The more explicit form (|43[l is better suited for the interpretational 
discussion; cf. [5S]. The two-to-one homomorphism from U(1) x SU(2) to U(2) is given 
by (exp(ioc) , A) 1— > exp(ia)A whose kernel is {(1,1) , ( — 1,-1)}. 

70 The non-linear spinor equation was at that stage not designed to include weak interac- 
tion. 




(43b) 
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importance of isospin for this programme may already be inferred from the 
title of the proposed common publication by Heisenberg and Pauli, which 
reads: On the Isospin Group in the Theory of Elementary particles. How- 
ever, due to Pauli's later retreat from this programme, the manuscript (cf. 
[IS] , Vol. IV, Part IV B, pp. 849-861) for this publication never grew beyond 
the stage of a preprint. 



3.8.3 Cosmological speculations 

In his last paper on the subject of discrete symmetries, entitled The Vio- 
lation of Mirror- Symmetries in the Laws of Atomic Physic^ Q38J, Vol.2, 
pp. 1368-1372), Pauli comes back to the question which bothered him most: 
How is the strength of an interaction related to its symmetry properties? 
He says that having established a violation of C and P symmetry for weak 
interactions, we may ask why they are maintained for strong and electro- 
magnetic interactions, and whether the reason for this is to be found in 
particular properties of these interactions. He ends with some speculations 
on possible connections between violations of C and P symmetry in the laws 
of microphysics on one hand, and properties of theories of gravitation and 
its cosmological solutions on the other! 72 ! 

Second, one can try to find and justify a connection between 
symmetry violation in the small with properties of the Universe 
in the large. But this exceeds the capabilities of the presently 
known theory of gravity. [. . .} New ideas are missing to go be- 
yond vague speculations. But this shall not be taken as definite 
expression for the impossibility of such a connection. 

It may be of interest to contrast this expression of a certain open-mindedness 
for speculations concerning the physics of elementary particles on one side 
and large-scale cosmology on the other, with a more critical attitude from 
Pauli's very early writings. In Section 65 of his Relativity article, where 
Pauli discussed Weyl's attempt for a unifying theory of gravity and electro- 
magnetism (to which Pauli himself actively contributed), he observes that 
in Weyl's theory (as well as in Einstein's own attempts from that time) it 
is natural to suspect a relation between the size of the electron and the size 



71 German original: "Die Verletzung von Spiegelungs-Symmetrien in den Gesetzen der 
Atomphysik" . 

72 German original: "Zweitens kann man versuchen, einen Zusammenhang der Symme- 
trieverletzungen in Kleinen mit Eigenschaften des Universums im Grossen aufzufinden 
und zu begriinden. Dies iiberschreitet aber die Moglichkeiten der jetzt bekannten The- 
orien der Gravitation. [...] Um bei der Frage des Zusammenhangs zwischen dem 
Kleinen und dem Grossen iiber vage Spekulationen hinauszugelangen, fehlen daher noch 
wesentlich neue Ideen. Hiermit soli jedoch nicht die Unmoglichkeit eines solchen Zusam- 
menhanges bestimmt behauptet werden." ([35], Vol. 2, p. 1371) 
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(mean curvature radius) of the universe. But then he comments somewhat 
dismissively that this might seem somewhat fantasti^^, p. 202). 



4 Conclusion 

I have tried to display some of the aspects of the notion of symmetry in 
the work of Wolfgang Pauli which to me seem sufficiently interesting in 
their own right. In doing this I have drawn freely from Pauli's scientific 
cevre, irrespectively of whether the particular part is commonly regarded 
as established part of present-day scientific knowledge or not. Pauli's faith 
in the explanatory power of symmetry principles clearly shows up in all 
corner of his cevre, but it also appears clearly rooted beyond the limits of 
his science. 

In the editorial epilogue to the monumental collection of Pauli's scientific 
correspondence, Karl von Meyenn reports that many physicists he talked to 
at the outset of his project spoke against the publication of those letters that 
contained ideas which did not stand the test of time ( [45] . Vol. IV, Part IV B, 
p. 1375). Leaving aside that this must clearly sound outrageous to the his- 
torian, it is, in my opinion, also totally misguided as far as the scientific 
endeavour is concerned. Science is not only driven by the urge to know but 
also, and perhaps most importantly, by the urge to understand. No one who 
as ever actively participated in science can deny that. One central aspect 
of scientific understanding, next to offering as many as possible alternative 
and complementary explanations for the actual occurrences in Nature, is to 
comprehend why things could not be different from what they appear to be. 
The insight into a theoretical or an explanatory failure can be as fruitful as 
an experimental failure. What makes Pauli a great scientist, amongst the 
other most obvious reasons, is not that he did not err — such mortals clearly 
do not exist — , but that we can still learn much from where he erred and 
how he erred. In that sense, let me end by the following words from Johann 
Wolfgang von Goethe's Maximen und Reflexionen 1292): 



Wenn weise Manner nicht irrten, miifiten die Narren verzweifeln. 
(If wise men did not err, fools should despair.) 



German original: "...was immerhin etwas phantastisch erscheinen mag." ([SS], p. 249) 
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